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Preface to the third edition 



In the six or seven years which have elapsed since the second edition was published the 
importance ol optoelectronics as a subject in its own right has continued to grow, and the 
applications of optoelectronic devices have increased significantly. Very few courses m 
physics or electronic engineering do not now include a discussion of optoelectronics. 

Our interpretation of what is meant by optoelectronics has remained unaltered since the 
first edition was published in 1983. That is, we define optoelectronics as the interaction of 
light (in the wavelength range from about 100 nm to 20 um) with matter in gaseous. liquid 
or solid form, and the devices which depend on these interactions. This definition is of course 
broader than that adopted by many authors who restrict their discussions to the wavs in which 
light interacts with semiconductors. This, however, ignores the many important devices that 
depend on the behaviour of light in crystals subject to external force fields, and the major- 
ity of lasers. 

Typical of the dramatic growth of optoelectronics is the staggering rise in fiber optic com- 
munications. Optical fibers with laser sources have enabled home subscribers to have access 
to an enormous amount of facilities and information, varying from telephone links, throuah 
many video channels, to information databases worldwide via the information superhich- 
way. The so-called information superhighway has resulted from the development of very 
low dispersion fibers coupled with the enormous bandwidth provided by laser sources, and 
the availability of very fast light detectors. Similarly there has been an amazing continua- 
tion in the growth of laser applications, not least in the field of medicine. 

The book was originally written very much with the final year UK undergraduate in mind. 
It has. however, been widely used as an introduction to optoelectronics for postgraduate stu- 
dents and those in industry, who require a treatment that is not too advanced, but which nev- 
ertheless gives a good introduction to the quantitative aspects of the subject. We see no reason 
to change this approach as the book has been used as a standard text by colleagues in very 
many institutions worldwide for both undergraduate and postgraduate students. We are grate- 
ful to those who. having used the book, have taken the time to point out minor errors and 
to make suggestions for improvements. 

This edition then does not aim to cover all aspects of optoelectronics nor to deal with some 
topics in full theoretical rigour, which in many cases would require a formal quantum 
mechanical approach. It aims rather to put special emphasis on the fundamental principles 
which underlie the operation of devices and systems. This, it is anticipated, will enable the 
reader to appreciate the operation of devices not covered here and to understand future devel- 
opments within the subject. 
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Optoelectronics relics heavily on the disciplines of optics and solid slate physics and we 
expect the reader lo have some background knowledge of these subjects. For those with little 
experience in these topics we have retained, after careful consideration, the original Chapters 
1 and 2 with some modifications, which provide a review of some of the relevant topics in 
these areas. 

Otherwise in this third edition, given the rapid developments mentioned earlier, we have 
taken the opportunity to update the material covered in all of the other chapters. We have 
also reduced the length of some sections and introduced new sections and topics to reflect 
the changing emphasis within the subject. Thus in Chapter 3 we have increased the empha- 
sis on parametric oscillation, and introduced a section on solitons in Chapter 9. 

The two chapters on lasers. Chapters 5 and 6. have been updated to include, for example, 
sections on the free electron laser, quantum well lasers, vertical emitting lasers and super- 
lutnincscen! diodes, fiber lasers and parametric lasers, together with a section on medical 
and industrial applications of lasers. 

In Chapter 7 there is a reduced emphasis on thermal detectors but an increased empha- 
sis on junction detectors, especially those operating in the near IR and on high speed devices. 

Chapter K on optical fibers has been updated to emphasize the increased importance of 
single mode fibers and very low dispersion (high bandwidth) fibers. Fiber manufacture is 
updated and the production of long wavelength fibers discussed. 

In Chapter 9 wavelength division multiplexing, optical amplifiers, solitons and coherent 
systems arc all introduced, along with a consideration of the performance of systems which 
depend on these topics. The systems covered include local area networks (LANs) and world- 
wide telephone links. The section on integrated optoelectronics has been updated and 
expanded. 

Finally, while the optical fiber sensor which rivals the ubiquitous thermocouple does not 
yet appear to have been developed, there is a continued high level of interest in optical fiber 
sensors and some recent developments, particularly on distributed systems, have been 
included in Chapter 10. 

Many readers of the previous editions have indicated the usefulness of the in-chapter exam- 
pics and we have retained these and introduced several more. These, we believe, give stu- 
dents a feci for the subject and of the orders of magnitude of the parameters involved, and 
provide j, better understanding of the text. We have also included more end-of-chapter prob- 
lems. A teachers' manual containing solutions to these problems can be obtained from the 
publisher. The book again uses SI units throughout with the exception of the occasional use 
of the electron volt (eV). 
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Light 



In discussing the various topics which we have brought together in this text under the title 
optoelectronics: of necessity we rely heavily on the basic physics of light, matter and their 
interactions. In this and the next chapter we describe rather briefly those concepts of optics 
and solid state physics which are fundamental to optoelectronics. The reader may be famil- 
iar with much of the content ofthe.se two chapters though those who have not recently studied 
optics or solid state physics may find them useful. For a more detailed development of the 
topics included, the reader is referred to the many excellent texts on these subjects, a selec- 
tion of which is given in refs 1 . 1 and 2.1. 

In this chapter we shall describe phenomena such as polarization, diffraction, interference 
and coherence; we have assumed that the basic ideas of the reflection and refraction of light 
and geometrical optics are completely familiar to the reader, though one or two results of 
geometrical optics are included for convenience. In this context it is worth noting that the 
term 'light' is taken to include the ultraviolet and near-infrared regions as well as the visible 
region of the spectrum. 



Nature of light 



During the seventeenth century two emission theories on the nature of light were developed, 
the wave theory of Hooke and Huygens and the corpuscular theory of Newton. Subsequent 
observations by Young, Malus, Euler and others lent support to the wave theory. Then in 
1864 Maxwell combined the equations of electromagnetrsm in a general form and showed 
that they suggest the existence of transverse electromagnetic waves. The speed of propa- 
gation in free space of these waves was given by 



<•- u 



where u and e () are the permeability and permittivity of free space, respectively. 
Substitution of the experimentally determined values of u () and e yielded a value for c in 
very close agreement with the value of the speed of light /;/ vacuo measured independently. 
Maxwell therefore proposed that light was an electromagnetic wave havine a speed c of 
approximately 3 x 10 K m s" 1 . a frequency of some 5 x I0 U Hz and a wavelength of about 
500 nm. Maxwell's theory suggested the possibility of producing electromagnetic waves 
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TABLE 1.1 Electromagnetic spectrum 
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with a wide range of frequencies (or wavelengths). In J 887 Hertz succeeded in generat- 
ing turn- visible electromagnetic waves, with a wavelength of the order of 10 m. by dis- 
charging an induction coil across a spark gap thereby setting up oscillating electric and 
magnetic fields. Visible light and Hertzian waves are part of the electromagnetic spec- 
trum which, as we can see from Table 1.1. extends approximately over the wavelength 
range of 1.0 ptti to 100 km. The wave theory thus became the accepted theory of light. 
However, while the wave theory, as wc shall see below, provides an explanation of optical 
phenomena such as interference and diffraction, it fails completely when applied to situ- 
ations where energy is exchanged, such as in the emission and absorption of light and the 
photoelectric effect. The photoelectric effect, which is the emission of electrons from the 
surfaces of solids when irradiated, was explained by Einstein in 1905. He suggested that 
the energy ol a light beam is not spread evenly but is concentrated in certain regions, which 
propagate like particles. These "particles' of energy subsequently became known as photons 
<G. N. Lewis, 1926). 

Einstein was led to the concept of photons by the work of Planck on the emission of light 
from hot bodies. Planck found that the observations indicated that light energy is emitted 
in multiples of a certain minimum energy unit. The size of the unit, which is called a quantum. 
depends on the frequency v of the radiation and is given by 



A.' = hv 



(1.2) 



where It is Planck's constant. Planck's hypothesis did not require that the energy should be 
emitted in luculized bundles and it could, with difficulty, be reconciled with the electro- 
magnetic wave theory. When Einstein showed, however, that it seemed necessary to assume 
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the concentration of energy travelling through space as particles, a wave solution was 
excluded. Thus we have a particle theory also: light apparently has a dual nature! 

The two theories of light are not in conflict but rather they are complementary. For our 
purposes it is sufficient to accept that in many experiments, especially those involving the 
exchange of energy, the particle (photon or quantum) nature of light dominates the wave 
nature. On the other hand, for experiments involving interference or diffraction, where light 
interacts with light, the wave nature dominates. 



v. '- z J Wave nature of light 

Light as an electromagnetic wave is characterized by a combination of time-varying elec- 
tric (%) and magnetic (9£) fields propagating through space (see e.g. ref. 1.1b. Chapters 
19-2 1; 1 . 1c. Chapter 1). Maxwell showed that both these fields satisfy the same partial dif- 
ferential equation: 

1 d : 

c- dr l ' 

This is called the mire equation: it is encountered in many different kinds of physical phe- 
nomena such as mechanical vibrations of a string or in a rod. The implication of eq. (1.3) 
is that chaiii-es in the fields propagate through space with a speed r, the speed of light. The 
frequency of oscillation of the fields, v, and their wavelength in vacuum. l t) , are related by 

f = vh (1.4) 

In any other medium the speed of propagation is given by 

where /; is the refractive index of the medium and X is the wavelength in the medium (later 
in the text we often drop the subscript from the vacuum wavelength X f) to simplify the nota- 
tion), n is given by 



W< (1.5a) 

where u r and e r are the relative permeability and relative permittivity of the medium respec- 
tively. 

The electric and magnetic fields vibrate perpendicularly to one another and perpendicularly 
to the direction of propagation as illustrated in Fig. 1.1: that is, light waves are transverse waves. 
In describing optical phenomena we often omit the magnetic field vector. This simplifies dia- 
grams and mathematical descriptions but we should always remember that there is also a mag- 
netic field component which behaves in a similar way to the electric field component. 

The simplest waves are sinusoidal waves, which can be expressed mathematically by the 
equation 

'f-lw /) = t (] eos(wf - kx + <}>) (j 5) 



LIGHT 



Hlectric 
field vector 




Direction of propagation 



Magnetic field vector 



FIG. 1.1 LIe< Irorna^netic wave: the electric vector («) and the magnetic vector W) vibrate in orthogonal 
planes and perpendicularly to the direction of propagation. 



where V is the value of the eleetrie field at the point x at time t. '£„ is the amplitude of the 
wave, id is the angular frequency (w = 2nv), k is the wavenumber (k = 2n/X) and $ is the phase 
constant. The term ( w/ - k\ + $ ) is the phase of the wave. Equation (1.6), which describes 
a perfectly monochromatic plane wave of infinite extent propagating in the positive .v direc- 
tion, is a solution of the wave equation (1.3). 

We can represent eq. (1.6) diagrammatically by plotting '£ as a function of either .v or t 
as shown in Figs 1 .2(a) and (b), where we have taken '£ = '£„ at .v and t equal to zero so that 
<(> = (). Figure 1.2(a) shows the variation of the electric field with distance at a given instant 
of time. If. as a representative time, we take t equal to zero, then the spatial variation of the 
electric field is given by 



/ = / „ cos k.\ 



(1.6a) 



Similarly Fig. 1.2(b) shows the variation of the electric field as a function of time at some 
specific location in space. If we take ,v equal to zero then the temporal variation of electric 
field is given by 



f' = /„ COS (Ot 



(1.6b) 



Equations ( 1 .6) can be written in a variety of equivalent forms using the relationships between 
v. Co, A., k and r already given. We note also that the time for one cycle is the period T(T= 1/v) 
as shown in Fig. 1.2(b). 

It the value of i at x = 0. t = is not J. () then we must include the arbitrary phase constant 
<j>. Equations ( 1 .6) can also be expressed using a sine rather than a cosine function, or alter- 
natively using complex exponentials. 

In the plane waves described above and in other forms of wave there are surfaces of con- 
stant phase, which are referred to as wave surfaces or wavefronts. As time elapses the wave- 
fronts move through space with a velocity v given by 

r = oi/k = vX (1.7) 

which is called the phase velocity. As it is impossible in practice to produce perfectly mono- 
chromatic waves we often have the situation where a group of waves of closely similar wave- 
length is moving such that their resultant forms a packet. This packet moves with the group 
velocity }' r A discussion of this phenomenon based on the combination of two waves of 
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(a) 




(b) 

FIG. 1 .2 Electric field m of an electromagnetic wave plotted as a function of (a 
and (b) the time t. 



) the spatial coordinate x 



slightly different frequencies moving together, which is illustrated in Fig. 1.3, shows that 
the group velocity is given by (see Problem 1.2) 



3co 



V ^M 



(1.8) 



Equations (1.6) represent plane waves moving along the x axis; we can generalize our 
mathematical description to include plane waves moving in arbitrary directions. Such a wave 
can be characterized by a wavevector k where \k\ = 2n/\ and eq. ( 1 .6) becomes 

i(x,y, :, t) = 'f. {) cos(co/ - k ■ r + <j>) r j m 

where r is a vector from the origin to the point (,v. v, -). Thus, for example, if we have a plane 
wave propagating in a direction 6 to the .v axis with its wavefronts normal to the ,vv plane 
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FIG. 1. i The i on< ept of .1 wave group or wave packet is illustrated by the combination of two progressist 
waves ot nearly equal frequency. The envelope of the group moves with the group velocity c„ 



as shown in Fig. 1 .4, we can write 

k = ik x +jk r (1.10) 

and 

r = i\+jy (1.11) 

where / and y are unit vectors in the x and v directions respectively. Combining eqs (1.10) 
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FIG. 1.4 Plane wave with its propagation vector k in the xy plane. The components of the propagation 
vector are k t = \k\ cos 6 and k s = \k\ sin 9. 
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and (1.1 I) we have 

k ■ r = _xk t + yk v = .\k cos e + yk sin e 
Hence we can write eq. (1.9) in this case as 

< r (.v,v,;) = < c : (( cos(w/ - \k cos 8 - yk sin 9 + $ ) ( ] _9 a) 

An equally important concept is that of spherical waves which, we can imagine, are neii- 
erated by a point source of light. If such a source is located in an isotropic medium it will 
radiate uniformly in all directions; the wavefronts are thus a series of concentric spherical 
shells. We can describe this situation by 

'/■ = — cos (tor -k- r) 
r 

where the constant :A is known as the source strength. The factor l/r in the amplitude term 
accounts for the decrease in amplitude of the wave as it propagates further and further from 
the source. As the irracliance 1 is proportional to the square of the amplitude, there is an 
inverse-square-law decrease in irradiance. If the medium in which the source is located is 
anisotropic, then the wave surfaces are no longer spheres; their shapes depend on the speed 
of propagation in different directions. We shall return to this point in section 3.2. 

The path followed by the normal to the wavefronts or wave surfaces is often described 
by the term light rays, and we may think of a beam of light as comprising a very large bundle 
of such light rays moving in the same direction. Similarly we can easily envisage Unfit rays 
diverging uniformly outwards in all directions from a point source of light situated in a homo- 
geneous region. The concept of light rays is useful when analyzing!" to a first approxima- 
tion, the behaviour of light as it passes through a 'system - of optical components such as 
lenses and mirrors - see section 1.3. 

1.2.1 Polarization 

If the electric field vector of an electromagnetic wave propagating in free space vibrates in 
a specific plane, the wave is said to be plane polari/ed. Any reafbeam of light comprises 
many individual waves and in general the planes of vibration of their electriJYields will be 
randomly orientated. Such a beam of light is unpolarized and the resultant electric field vector 
changes orientation randomly in lime. It is possible, however, to have light beams charac- 
terized by highly orientated electric fields and such light is referred to as being polarized. 
The simplest form of polarization is plane polarized light, which is similar to the sinele wave 
shown in Fig. 1 . 1 . Other forms of polarization are discussed in section 3.1. 

Light can be polarized in a number of different ways and here we consider two. namely 
polarization by reflection and by absorption. When unpolarized light is incident on a mate- 
rial surface as shown in Fig. 1.3(a) we find that the light with its' polarization vector per- 
pendicular to the plane of incidence (denoted by — > ) is preferentially reflected in 
comparison with light polarized parallel to the plane of incidence (denoted by -|-|-»). We 
can resolve the electric field vector of each wave into components parallel and perpendic- 
ular to any convenient direction: here we choose the plane of incidence. Thus by symme- 
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FIG. 1.5 in (,i), a light reflected from the interface between two media of refractive index n, and n, is 
partially plane polarized (there is less of the parallel component | | > than the perpendicular component 
» ♦ » ). while in (b) for incidence at the Brewster angle 9 B , the reflected light is completely plane polarized. 



try we may think of unpolarized light as comprising two equal plane polarized components 
with orthogonal orientations. 

We find further that the reflectance 2 of the surface for the perpendicular and parallel com- 
ponents varies as a function of the angle of incidence as shown in Fig. 1.6. In particular we 
note that for the parallel component the reflectance is zero at the specific angle of incidence 
9 = 6 B . For this angle of incidence, which is called the Bre wster angle, all of the parallel com- 
ponent is transmitted. It is found also that for incidence at the Brewster angle the reflected 
and refracted (transmitted) rays are perpendicular to one another as illustrated in Fig. 1.5(b). 
Thus, using Snell's law for the refraction of light at the interface between two media of refrac- 
tive indices n, and n->, that is 



», sin 6| = »i sm 9 2 



(1.12) 



where 9, and 9 2 are the angles of incidence and refraction respectively, we have 
h, sin 9 B = n 2 cos 9 B 



or 



tan 9 B = 



»■> 



(1.13) 



Equation (1.13) is known as Brewster's law. One way of polarizing light is simply to pass 
it through a series of glass plates orientated at the Brewster angle. Then at each surface, some 
of the light polarized perpendicularly to the plane of incidence is reflected, as shown in Fig. 
1 .7, while all of the parallel component is transmitted. After passing through about six plates, 
the transmitted light is highly plane polarized. We shall see in Chapter 5 that lasers often 
include surfaces inclined at the Brewster angle to minimize optical losses for one particu- 
lar orientation of polarization of the light passing through the surfaces. Such lasers thus emit 
plane polarized light. 
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Reflectance 1.0 




60" 90° 

Angle of incidence, 0, 



(a) 



Reflectance 1.0 




(b) 



60° 



90° 
Angle of incidence. 0, 



FIG. 1.6 Reflectance as a function of angle of incidence for light polarized parallel (II) and perpendicular 
(1) to the plane of incidence incident on the interface from (a) the less optically dense side and (b) the 
more optically dense side. The values of 9 H = 56.3° (a) and 33.7' (b), and 9, = 41 .8°, are for an air 
(n = 1 )/glass (n = 1 .5) combination. 
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Un polarized 
light 




Plane polarized 
light 



fit.. 1.7 S< hcm.ilH illustration ol the production of plane polarized light bv reflection using a 'pile of 
plates'. I lie light is mi ideni oil the plates at the Brewster angle so that with about six plates the emergent 
light is plane polarized parallel to the plane of inc idence. For simplicity the effects of refraction at the 
stirlai es of the plate are ignored. 



EXAMPLE 1.1 Brewster anj-Ie 



We may calculate (he Brewster angle for glass given that its refractive index in air is 
1.5. 

F ; rom eq. (1.13) and assuming that the glass has an interface with air (/? = 1 ) we have 
B„ = tan ' 1 .5 = 56.3° (see Fig. 1 .6). 



In Fig. 1.5, and in eqs (1.12) and (1.13), we have tacitly assumed that /?,>»,, but for 
incidence at a boundary separating a more optically dense medium from a less optically 
dense medium the reverse is obviously the case. Hence the Brewster angle 'in air* is dif- 
ferent from the Brewster angle 'in the material'; for air (/7= 1) to glass (/? = 1.5) the two 
angles are tan '1.5 (or 56.3°) and tan~'0.67 (or 33.7°). The reflectance of light polarized 
parallel or perpendicular to the plane of incidence varies with the angle of incidence in the 
more dense medium (i.e. n l >// 2 ) as shown in Fig. 1.6(b). We see here that for angles greater 
than a certain angle 9 C , the critical angle, there is no transmitted light, that is total internal 
reflection has taken place. 

At 9 = 9 C the angle of refraction (in the less dense medium) is 90°, so from eq. ( 1 . 1 2) we 
can write 

/;, sin 6, = » 2 sin 90° 



or 



8 



stn 



'(»;/» I ) 



(1.14) 



Total internal reflection is discussed in more detail in relation to the propagation of light 
along optical waveguides in Chapter 8. 

Light can also be polarized by selective absorption in various materials such as tourma- 
line, which occurs naturally, and 'polaroid'. Polaroid consists of a plastic sheet of polyvinyl 
alcohol impregnated with iodine. Molecules of iodine polyvinyl alcohol are orientated into 
long chains by stretching the sheet. This material then transmits about 80% of the light polar- 
ized perpendicular to the chains of molecules but less than 1% of the light polarized paral- 
lel to these chains. Evidently the chains of molecules interact with this component and 
effectively absorb it. Polaroid is widely used for producing plane polarized light in optical 
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systems, but various polarizing prisms such as the Nicol prism and the Glitn-TlfegSoh 
prism are more efficient in terms of the degree of polarization (which is defined as tN&raao 
of the irradianee of the polarized component to the total irradiance). "^N^.* 1 

If the light incident on such a linear' polarizing device is already plane polarized, then" 
the amount of light transmitted depends on the angle 8 between the plane of polarization of 
the incident light and the plane of polarization of the light transmittal by the polarizer (often 
called the polarizing axis). We can easily show (see Problem 1 .4) that i"f f. lt is the amplitude 
of the incident light then the transmitted amplitude isV t) cos B and the irradiance of the trans- 
mitted light is given by 3 

/=£ ( 2 ,cos 2 6 = / cos : 9 (i.i.S) 

This relationship is known as Mains' s law. 

1 .2.2 Principle of superposition 

In discussing optical phenomena we are often confronted with the problem of finding the 
resultant produced when two or more waves act together at a point in space. It is found that 
the principle of superposition applies. This states that 'the resultant electric field at a given 
place and time due to the simultaneous action of two or more sinusoidal waves is the altic- 
braic sum of the electric fields of the individual waves'. That is. 

*=«,+l 2 + «, + ... 

where $,,$•,. ?, are the electric fields of the individual waves at the specified time and place. 
The summation can be earned out in several different ways, the choice of method for a par- 
ticular problem being a matter of mathematical convenience (ref. 1.2). 

Let us consider the simple case of the superposition of two waves of the same frequency 
propagating in the same direction given by 

t { = 'f, m sin(cor- A.-.Y+ <j>,) 

^■2 = ^-02 sin(co/ - k.\ + <$> 2 ) U- 16 ) 

The resultant is 

I = '£,„ sin(o)r - kx + ((>,) + '<? ,sin(oy - kx + <t>0 
which may be written as 

't = (?.,„ cos ()), +'£„, cos <|> 2 )sin(<Dr - kx) 
+ ('£ , sin <t>j + t i)2 sin <J> 2 )cos(av - kx) 
This is identical with 

'£ = '£ () sin<io/-/:v + <t>) (1.17) 

provided that 

'£, 2 , = ('£,„ coscj), +'t, l)2 cos (|) 2 ) : + Cf () , sin <>, + ^" ( , 2 sin <>-,)- 
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or 

i'-i = 'f-m+'f'-;>2 + 2 l 'M l '-»iC™($2 -<t»i) (1.17a) 

and 

(■;.,„ sin <t»! -»- V; 02 sin «> 2 ) 

tan 4>= (1.17b) 

(V. ()l COS (J), + 7, ()2 COS <))-,) 

The form of eq. (1.17) implies that the resultant of adding two sinusoidal waves of the 
same frequency is itself a sinusoidal wave with the same frequency as the original waves 
given by eqs (1.16). The resultant is, of course, a solution of the wave equation ( 1 .3). By 
repeated application of this process we can show that the resultant of any number of sinu- 
soidal waves of the same frequency is a sinusoidal wave of that frequency. In this case the 
resultant is given by eq. ( 1 . 1 7), but now we have 



/. 



(X V o< COS( M +(X V 'o,-sin<l>,- 



or 



■> : » = Z r «> + ZX v* ; <> ; cos ^y- w (i- lg ) 



1 ' j 

'*] 
and 

I, *<„■ sin «(»,. 
tan m = 



1, *-o,cos<|>,. 



Now if the original light waves are from completely independent sources, including sep- 
arated regions of an extended source, the phase difference (<)>■- <j>.) in eq. ( 1. 18) will vary in 
a random way such that the average value of cos(ij). - <j>,) is zero. That is, for every possible 
positive value of phase difference there is a corresponding negative value. Therefore, the 
irradiance resulting from the superposition of the light from such independent sources is given 

by 

The resultant irradiance is seen to be the sum of the irradiances which would be produced 
by the individual sources acting separately. In this case the sources are said to be incoher- 
ent and an area on which the light from such sources falls will be uniformly illuminated. If, 
on the other hand, the waves from the different sources maintain constant phase relation- 
ships they are said to be coherent. When an area is illuminated simultaneously by two or 
more coherent sources, the irradiance usually varies from point to point giving rise to inter- 
ference fringes. 

If we consider a surface illuminated by two coherent sources, we can see that as we move 
across the surface the relative distance to the sources changes and the phase difference also 
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changes. Hence the term eos(<j>, -<)),) varies periodically between plus one and minus one 
and the irradianec varies from a maximum value / , where 

'mas = ^ O = "'-Dl + ' 02 + -' 01 ' (12 

= ('< f ' <)1 +'^ ()2 )- (1.20) 

when the waves are in phase and cos(<t), -<t>,) = + I, to a minimum value /„„„, where 

'mill = f ' 01 + *'Ul ~ '-f-()l ( ^ (12 
= ('- f: l .,-^,2) : 

when the waves are exactly out of phase and cosfo, - 0, ) = - 1 . Thus a regular system of alter- 
nate bright and dark interference fringes is formed over the surface. Extending the argument 
to a large number of coherent sources we can see from eq. (1.20) that the resulting irradi- 
ance at any point where all the waves are in phase is 



s*. 



(1.21) 



At a nearby point the resultant irradiance may be very small, or indeed zero, depending on 
the relative magnitudes of the terms ( <)r The interaction of waves from coherent sources thus 
alters the spatial distribution of energy without altering the total amount. The topics of inter- 
ference and coherence will be pursued further in the next section and section 6.5. 1 .3 respec- 
tively. The superposition of polarized waves is considered in section 3.1. 

EXAMPLE 1.2 Maximum irradiance of several coherent and incoherent sources 



Here we compare the maximum irradiance resulting from the superposition of equal 
contributions / from four (a) coherent sources and (b) incoherent sources. 

(a) From eq. (1.21) we have / m; ^ = /r/ = 16/, while (b) from eq. (1,19) we have 
/.„=/»/ = 4/. 



STANDING WAVES 

Let us now consider the superposition of two waves of the same frequency travelling in oppo- 
site directions. The waves may be represented by the equations 

'£, ='£(, sin(w/ - ,tv) 

and 

'& 2 = '£ () .sin(wr + L\) 

where, for convenience, we have taken /,„ ='f „, = /.„ and «>, and <)> 2 = 0. Then I = £,+?■„ 
and hence 

% = 2'<£ () cos he sin tat (1.22) 
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Wave amplitude 




HG. l.a Amplitude profile in a standing wave as a function of distance (N. nodes; AN, antmodes) 



liquation (1.22) represents a standing wave and is illustrated in Fig. 1.8. In such a wave there 
is no net transfer of energy in either direction. The energy density in the medium, being pro- 
portional to the square of the amplitude of the motion (i.e. All cos 2 kx), varies from a 
maximum at the antinudes to zero at the nodes. Standing waves of this type may be set up 
in optical cavities (or resonators), which consist of a pair of parallel, highly reflecting mirrors. 
Such a cavity forms an integral part of any laser. 



1.2.3 Interference 

The basic mathematical description of 'two-beam' interference is given by eq. (1.17a): that 
is. 



. i , ~i 



I = f L, = * in + f 02 + 2'<r ()I <,,,, cos(<t) 2 - <)> j ) 

/,„ = f nl then 

i = 2f; n [] + cos(<j> 2 -<}>,)] 



or 



I = 4f : n cos 2 / 



1.23) 



Equation ( 1 .23) shows that the irradiance distribution of the fringes is given by a cosine- 
squared function. If the contributions from the coherent sources are equal the irradiance of 
the fringes varies from 4(,] n to zero as (<j) 2 -<)>,) varies between and n. If i ()I */ 02 , the resul- 
tant irradiance varies between (i in + '£ ()2 ) 2 and ('<£„, - o ( , 2 ) 2 . 

To obtain the coherent wave trains required for the observation of interference before the 
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advent of lasers one had to ensure that: 

1. The sets of wave trains were derived from the same small source of light and then brought 
together by different paths. 

2. The differences in path were short enough to ensure at least partial coherence of the wave 
trains (i.e. the differences in path were less than the coherence length of the source - 
sec section ft. 5. 1.3). 

The basic ways of satisfying these requirements and demonstrating interference can be 
classified into two groups, namely 'division of wavefront' and "division of amplitude'. The 
classic experiment of Young, in 1802. falls into the former group. In this experiment, which 
is illustrated in Fig. 1.9, monochromatic light is passed through a pinhole S so as to illumi- 
nate a screen containing two further identical pinholes or narrow slits placed close together. 
The presence of the single pinhole S provides the necessary mutual coherence between the 
light beams emerging from the slits S, and S 2 . The wavefronts from S intersect S, and S, 
simultaneously so that the light contributions emerging from S, and S, are derived from the 
same original wavefront and are therefore coherent. These contributions spread out from S, 
and S, (if these are long, thin slits) as •cylindrical' wavefronts and interfere in the region 
beyond the screen. If a second screen is placed as shown then an interference pattern con- 
sisting of straight line fringes parallel to the slits is observed on it. 

To find the irradiance at a given point P it is necessary to find the phase difference <p 
between the two sets of waves arriving at P from S, and S,. This in turn depends on the path 
difference (D-,-D,) as in «eneral 



phase difference = — (optical path difference] 

A.,, 




Sireen 1 



Irradiance 
distribution 



Screen 2 



FtC. 1.9 Schematic- layout and lyomctrv tor a Youni^ double slit ink'rteroiKV experiment. 
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or using eq. (1.5), 

2n 

^ 

where D l and D, are the distances from S, and S, to P respectively. 

Bright fringes occur when the phase difference is zero or ±2pn. where /; is an integer; that 
is, when 

■- (D l -D { )= ±2pn 

K 

which is equivalent to \D 2 - D , \ = pX. Therefore bright fringes occur if the path difference 
is an integral number of wavelengths. Similarly, dark fringes occur when q = ±(2p + l)7t, or 
the path difference is an odd integral number of half-wavelengths. It is left as an exercise 
for the reader to show, with the parameters given in Fig. 1 .9, that bright fringes occur at point 
P a distance v from O such that 

p\\ 
y=±'-- (1.24) 

// 

provided both v and H are small compared with .v. Here H is the slit separation and v is the 
distance from the screen containing the slits to the observing screen. 

Equation ( 1 .23) suggests that in such interference fringe patterns the irradiance of the fringe 
maxima will be equal. This is not the case, however, owing to diffraction effects, which are 
discussed in section 1.2.4. 

Interference effects involving amplitude division can be observed in thin films or plates 
as illustrated in Fig. 1.10. In this case, interference occurs between the light reflected at A 
on the front surface of the plate and at B on the rear surface. If the plate has parallel faces 
then the two sets of waves from A and C are parallel and a lens must be used to bring them 
together to interfere. Using elementary geometry and Snell's law (eq. 1.12) the reader may 
show that the optical path difference (AB + BC)/i- AD is equal to 2nL cos 9 2 where 6 ; is 
the angle of refraction and L is the plate thickness (see Problem 1 .7). The phase difference 
is then (2k/\ } )(2iiL cos 9 2 ) and therefore bright fringes occur when 

4nnL cos 0, 

1 = 2/m 

i 

A.,, 

that is, 

/>X„=2/iZ.cos8 2 (1.25) 

where again \ is the wavelength of the radiation in vacuum. 
Likewise, dark fringes occur when 

(2/;+ I)X,,/2 = 2/iLcos8 2 (1.25a) 

If the plate is optically denser than the surrounding medium, there is a phase change of tt 
on reflection at the upper surface, thereby causing the above conditions to be interchanged. 
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observed in the focat 
plane of the lens 



FIG. 1.10 Schematic diagram of interference effects in a thin film or plate of refractive index it. 

For a given fringe;;, \ } , L and n are constant and therefore 8;, must be constant; the fringes 
are known as 'fringes of equal inclination'. If the angle of incidence is not too large and an 
extended monochromatic source is used then the fringes are seen as a set of concentric cir- 
cular rings in the focal plane of the observing lens. 

When the optical thickness of the plate is not constant and the optical system is such that 
9 2 is almost constant, the fringes are contours of equal optical thickness nL. The situation 
may be illustrated by considering a small-angled wedge. If the wedge is uniform the fringes 
are approximately straight lines parallel to the apex of the wedge. Again it is left to the reader 
to show that the apex angle a is given by 



a = tan a = 



2S 



(1.26) 



where S is the fringe spacing. The fringes are often close together and are conveniently viewed 
with a low power microscope. 



MULTIPLE BEAM INTERFERENCE 

If the reflectances of the surfaces of the plate shown in Fig. 1.10 were increased there would 
be many reflected beams to contribute to the interference pattern rather than just the two shown. 
In practice the resultant interference pattern is seen more clearly if the transmitted beams rather 
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than the reflected ones are used, as shown in Fig. 1 . 1 1 . If the plate has parallel sides then the 
multiple beams are parallel and are brought together to interfere in the focal plane of the lens. 
The resultant of superposing these beams can be calculated quite easily as the phase differ- 
ence from one beam to the next is constant. The phase difference is due to two additional tra- 
versal* of the plate plus any phase changes which may occur on reflection at the surfaces of 
the plate. If the latter are ignored then the condition for the formation of fringe maxima is 
identical with the two-beam case given in eq. (1.25). The irradiance distribution is no longer 
the cosine-squared distribution of eq. (1.23), but rather it is given by (ref. 1.3) 



i,r 



i = ----- — , 



i 



R) 2 { l + [4/?/(I -/?) 2 ]sin 2 (8/2) 



1.27) 



where /„ is the irradiance of the incident beam, R and T are the reflectance and transmit- 
tance respectively of the plate surfaces and 5 is the total phase change between successive 
beams. It should be noted that 7=1-/?, providing that there is no absorption in the plates. 
If/? is large, greater than about 0.8 say, then the fringe maxima are very sharp as shown 
in Fig. 1.12. Fringes of this type are formed in the Fabry-Perot interferometer. This instru- 
ment includes a pair of highly reflecting surfaces set accurately parallel to one another 
forming an optical (or Fabry-Perot) resonator as mentioned above and which is discussed 
further in section 5.5. The resolving power of the Fabry-Perot interferometer can be very 
high and it is widely used in the accurate measurement of the hyperfine structure of spec- 
tral lines. 
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ric. 1.11 Paths of light rays resulting from multiple reflections from two parallel plates (for simplicity th< 
re<ir surfaces of the plates and refraction at the surfaces are not shown). 
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FIG. 1.12 Schematic diagram showing the variation in the inadiance distribution of multiple beam inter- 
ference fringes as a function of mirror reflectance R, 



Multiple beam interference effects can also be produced by division of wavefront. This 
is achieved by increasing the number of slits from two, the ultimate being reached in the 
diffraction grating where there is a very large number of equal slits. As before, the condi- 
tion for interference maxima is that the contributions to the total irradiance emerging from 
successive slits should be in phase. It can be shown using elementary theory that, for normal 
incidence, maxima in the interference pattern produced by a diffraction grating occur when 



pX = {a + b)n\n Q = cl sin 8 



< I -28) 



where d(=a + h) is the grating constant, with a and /; the width and separation of the slits 
respectively and 9 the angle of diffraction as shown in Fig. 1. 1 3. For a more complete dis- 
cussion of the theory of the diffraction grating the reader is referred to the texts given in 
ref. 1.1. 



1.2.4 Diffraction 

If an opaque object is placed between a source of light and a screen it is found that the shadow- 
cast on the screen is not perfectly sharp. Some light is present in the dark zone of the geo- 
metrical shadow. Similarly light which emerges from a small aperture or narrow slit is 
observed to spread out. This failure of light to travel in straight lines is called diffraction; it 
is a natural consequence of the wave nature of light. 

The essential features of diffraction can be explained by Huygens' principle, which states 
that: 

The propagation of a light wave can be predicted by assuming that each point on the wave- 
front acts as a source of secondary wavelets which spread out in all directions. The enve- 
lope of these secondary wavelets after a small interval of time is the new wavefront. 
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0, angle of diffraction 



_j=t^ First order 







Zeroth order 
(P = 0) 



Incident plane, n 
monochromatic «i Multiple slits or 
wavefront diffraction grating 

FIG. 1.13 Schematic illust ration of the formation of interference maxima by a diffraction grating. If the 
incident light is not monochromatic then a first-, second- . . . order interference fringe forms for each 
wavelength present at different values of 8 so that each can therefore be distinguished. 
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FIG. 1.14 Illustration of the application of Huygens' principle to the propagation of a plane wavefront 
(dashed lines). The small circles are typical 'point sources' on the wavefront which emit secondary 
wavelets. The wavefront at a later time is the resultant of the secondary wavelets. The diagram shows (in an 
exaggerated way) that an initially plane wavefront will diverge as it propagates. 
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In effect the spherical wavelets emitted from the point sources are summed using the prin- 
ciple of superposition; the result of the summation is equivalent to the wavefront at a later 
time. The propagation of a plane wave according to Huygens' principle is illustrated in Fig. 
1.14. We see that the wavefronts develop some curvature at the edges due to the radiation 
from the end points being directed away from the axis. Succeeding wavefronts become more 
and more curved so that the beam diverges. 

A quantitative description of diffraction can be obtained by setting Huygens' principle 
in a mathematical form known as the Fresnel - Kirchhoff 'formula (ref. 1.4). 

In the detailed treatment of diffraction it is customary to distinguish between two general 
cases known as Fraunhofer and Fresnel diffraction. Qualitatively, Fraunhofer diffraction 
occurs when the incident and diffracted waves are effectively plane, while in Fresnel dif- 
fraction the curvature of the wavefront is significant. Clearly there is not a sharp distinction 
between the two cases. 

Of particular importance is the Fraunhofer diffraction produced by a narrow slit; the 
arrangement for observing this is shown in Fig. 1.15(a). Over 80% of the light passing through 
the aperture falls within the central maximum of the pattern, on either side of which there 
is a series of low irradiance secondary maxima. These maxima are separated by minima 
which occur at angles of diffraction e given by 



sin B = : 



D 



11-29) 
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FIG. 1 .1 5 Schematic arrangement for observing the Fraunhofer diffraction produced by a single slit or 
aperture (a) and the resultant irradiance distribution as a function of the an fi le of diffraction 9 (b). 
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where D is the width of the slit. A rather more important case is that of the circular aper- 
ture which produces a pattern consisting of a central bright area surrounded by concentric 
dark and bright rings. About 84% of the light is concentrated within the central spot, which 
is called the Airy disk. A measure of the amount of diffraction is given by the angle 9 at which 
the first dark ring occurs. It may be shown, using the Fresnel-Kirchhoff formula, that 9 is 
given by 

X X 
sin 9= 1.22 — -— (1.10) 

D D 

where I) is the diameter of the aperture. In many applications the output from a laser or other 
source is focused, expanded, or otherwise passed through lenses, prisms, stops and the like. 
In each case the edges of the optical component may serve as an aperture limiting the extent 
of the beam and thereby introducing additional divergence due to diffraction. 

In passing, it is noteworthy that diffraction sets a theoretical limit to the resolving power 
of optical instruments and it is also responsible for the occurrence of 'missing orders' in the 
interference patterns produced by double and multiple split arrangements. The irradiance 
distribution of such patterns has an envelope which is the single slit diffraction pattern shown 
in Fig. 1.15(b). Thus at certain angles 9 where an interference maximum is expected none 
appears because a diffraction minimum occurs at the same angle. 



EXAMPLE 1..J Resolving power: the Rayleigh criterion 



We may estimate (a) the minimum separation of two point sources that can just be 
resolved by a telescope with an objective lens of 0.1 m diameter which is 500 m from 
the sources and (b) the minimum wavelength difference which may be resolved by a 
diffraction grating, which is 40 mm wide and has 600 lines mm" 1 , in the first order. 
(Assume 1 = 550 nm in both cases.) 

We define resolving power in terms of the Rayleigh criterion, which states that two 
objects (or wavelengths) are just resolved if their diffraction patterns when viewed 
through the optical system are such that the principal maximum of one falls in the first 
minimum of the other. 

In (a) this criterion therefore requires that the sources have an angular separation 
of 9 = X/D (see eq. 1 .29), that is 9 = X/D. Now 9 is also given by 6 = S mjn /5(K). where 
S , i* the minimum separation of the sources. Hence X/D = S, mil /500, whence 

500x550x10^ 

S = = 2.75 mm 

0. 1 

For(b) the chromatic resolving power A./8X. of a grating is given by X/SX = pN, where 
N is the number of lines used. Therefore, in the first order p= 1, X/bX = I x 40x600; 
that is, 5A. = 0.023 nm. 
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v'- J > Optical components 

Various optical components such as mirrors and lenses are widely used in optoelectronics. 
For example, as we shall see in Chapter 5 the operation of lasers is crucially dependent on 
an optical system comprising two plane or spherical mirrors, whilst lenses are often used to 
collect, collimate and focus the light from diverging sources such as light-emitting diodes 
(section 4.6). Mirrors, lenses and prisms are also used to manipulate and stear beams of radi- 
ation within a variety of optical/optoelectronic systems. 

In this section we present the various formulae dealing with the operation of such com- 
ponents, the appropriate derivations of which may be found in most elementary optics text- 
books (see e.g. ref. 1.1). 

1.3.1 The spherical mirror 

Figure 1 . 1 6 shows a ray from a source S on the axis of a spherical mirror at a distance .v from 
its vertex V. The ray is redirected by the mirror to pass through the axis at the point T. a 
distance s' from V. Provided the rays are close to the axis (i.e. provided they are 'paraxial' 
rays. ref. 1.5). then the distances s and ,v' are related by the equation 



1 2 
s' r 



(1.31) 



where r is the radius of curvature of the mirror. 

If the incident rays are parallel to the axis, that is s = °°, then the image is formed at the 




HC. 1.16 Diagram showing the reflection of light from an object S on the axis of a spherical mirror to 
form an image at T. 
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focal point of the mirror F, which is a distance/from V. Thus from eq. ( 1 .3 1 ) putting/= 2r 
we have 



1 1 

- + — 
s s' 



1 

7 



(1.32) 



Similarly if the object is placed at F, the image is formed at infinity. Likewise if a small source 
of light is located at F, a parallel beam will be generated. 

In applying equations such as eq. (1.32) it is necessary to adopt a sign convention for the 
distances involved. Of the several available a convenient one is to adopt the signs used in 
coordinate geometry. Consequently distances measured from the vertex are positive if mea- 
sured to the right and negative to the left, and similarly distances measured upwards and 
downwards from the axis are positive and negative respectively. Thus in eq. (1.32) s, s' and 
/arc all negative so the form of the equation remains unchanged. 

The transverse magnification, m, of the mirror can be shown to be given by 

(1.33) 



m 



-s'/s 



the minus sign indicating that the image is inverted (Fig. 1.17). 

1 .3.2 The thin spherical lens 

Figure 1 . 18 shows the path of a ray which originates from a point at S in a medium of refrac- 
tive index h, as it passes through a simple spherical lens. The latter has two spherical faces 
with radii r, and r 2 which are separated by a distance d along the axis and it is made from 
a material with a refractive index of n 2 . The ray is diffracted as it passes through the two 
surfaces of the lens and subsequently crosses the axis at a point T. Thus an object placed at 
S will give rise to an image at T. If the distances from the lens surfaces to the points S and 
T are .v and s' and the thickness of the lens is negligible compared with j and s', then we have 
that 




FIG. 1.17 Diagram showing the formation of an image of an off-axis object, which can be used to deter- 
mine the magnification y'/y. 
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FIC. 1.18 Schematic diagram showing the behaviour of a double convex ler 



(/!-,- 72,) 



..34) 



In most cases of practical importance the object and image are in air so that we may put n , = I . 
Writing n 2 = n and noting that when s' = «=, s =f t where/ is the focal length (similarly if ,v = °°, 
s'=f), we have in general 



1 1 



/' / 



1 f [ 



1.35) 



where/and/', which are numerically equal, are often referred to as the first and second focal 
lengths respectively. Frequently, however, we simply refer to the focal length of a lens 
meaning the second focal length; the focal lengths of converging lenses such as that shown 
in Fig. 1.19 are then positive, whilst those of diverging lenses are negative. 

By considering the formation of the image of an object which extends beyond the axis 
as shown in Fig. 1.19, the lateral magnification of the lens, m, may be shown to be given 
by an equation which is identical to eq. (1.33): 

- s'/s 



m 



As the object becomes larger, and/or as the diameter or aperture of the lens increases, some 
of the rays of light from the object to the lens will increasingly violate the 'paraxial' con- 
dition leading to a reduction in the quality of the image. Such effects are referred to as mono- 
chromatic aberration, the most familiar form of which is spherical aberration (ref. 1.6). 

Equation (1.35) shows that the focal length is dependent on the refractive index of the 
lens material, which in turn is a function of wavelength. Thus images formed by simple lenses 
often have a coloured boundary due to this chromatic aberration. 

It is worth noting that optical components inevitably have a finite aperture and can there- 
fore only collect a fraction of the wavefronts from the object. Consequently there will always 
be an apparent deviation from rectilinear propagation when light passes through a lens, or 
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FIC. 1.19 Diagram showing the formation of an image at T, by a biconvex lens, ot an ob|ivt at S. which 
extends away from the optic axis. The diagram enables the lateral magnification m = \7"v t<> be evaluated in 
terms of the object and image distances. 



other aperture, so that the waves will be diffracted; the perfection of the image is thus said 
to be diffraction limited. 

1.3.3 Other lenses 

In practice simple thin lenses seldom completely satisfy the requirements placed upon them, 
and there is a wide range of compound lenses available, which comprise two or more lenses 
either in contact or separated by small distances. Such lens 'systems" may correct for the 
aberrations mentioned above, or yield very short focal lengths and/or very high magnifica- 
tions. Typical of these is the microscope objective lens which is often used in the labora- 
tory to focus light into very small apertures. Other lenses used for this purpose include the 
gradient index or GRIN lens and sphere. Small spheres, some 2-5 mm in diameter, are easier 
to manufacture than comparable lenses and are easier to align in experimental applications. 
The focal length of a sphere lens lies just outside of the sphere, and Fig. 1 .20 shows the use 
of two such spheres for the collection, collimation and refocusing of the light from a small 
source. 

GRIN lenses use materials in which the refractive index varies in some controlled way 
so that, by combining the refraction which occurs at the surface of the lens with that which 
occurs continuously as a function of the varying refractive index, an image can be produced. 
A typical GRIN lens is a cylindrical rod in which there is a radial variation of the refractive 
index, which is given by (ref. 1.7) 



n(r) = nJ\--r 



(1.36) 



where n is the refractive index on the lens axis, and A is a positive constant. As a result of 
the refractive index variation, rays of light incident on the end of the cylindrical lens follow 
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Li"h! source 



Sphere lens Sphere lens 

FIG. 1.20 C oilection, cotlimation and rcfo< using ol' light using !wo sphere lenses 




•Sinusoidal light palh 
FIC. 1.21 Schematic diagram showing [ho path of a ray ol' light through a GRIN Ions. 
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FIG. 1 .22 Operation of typical GRIN lenses: !a) the quarter-pitch Ions; lb) tho 0.2W Ions, Ltsod to collect 
the light diverging from a small source and eollimate it into a parallel beam; to) tho 0.29/' lens used, for 
example, to couple the output from a source into a small aperture. 
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sinusoidal paths as illustrated in Fig. 1.21, with a period or pitch P of the path given by 
P = 2n/XA, while the maximum acceptance angle G of the lens is given by 

sin B = h,/?V/* (1.37) 

where R is the lens radius. The characteristics of a GRIN lens in terms of the relationship 
between the image and object distances are determined by the length of the lens relative to 
the pitch. The quarter-pitch (0.25P) lens produces a perfectly collimated output beam when 
the light emanates from a point source on the opposite face of the lens. Conversely the lens 
focuses a collimated beam to a point at the centre of the opposite face (Fig. 1.22a). that is 
the focal point of the quarter-pitch GRIN lens is coincident with the lens faces. On the other 
hand Figs 1.22(b) and (c) show GRIN lenses which are either just shorter (0.23/ 3 ) or just 
longer (0.29P) than a quarter pitch. In these cases the focal points lie just outside the lens 
and are therefore easily accessible in terms of locating a source of light or an image. Figure 
1 .22(c) illustrates how light from a small source can be coupled into a small aperture, when 
both are located close to the ends of the lens. The dimensions of such lenses are typically 
2.0 mm in diameter by 6 mm in length, while the acceptance angle is typically 20°-28°. 



O- 4 ) Light sources - blackbody radiation 

The sources discussed in this section are the so-called classical or thermal sources. These 
arc so named because they radiate electromagnetic energy in direct relation to their tem- 
perature. Thermal sources can be divided into two classes, namely blackbody radiators and 
line sources. The former are opaque bodies or hot, dense gases which radiate at virtually all 
wavelengths. Line sources, on the other hand, radiate at discrete wavelengths. 

1.4.1 Blackbody sources 

The radiation from opaque objects and dense gases was widely studied in the late nineteenth 
century resulting in the formulation of the following empirical laws. First, it was found that 
the optical power radiated from a body was proportional to the fourth power of the absolute 
temperature, T, and could be written as 

W = ecAT* (1.38) 

where A is the area of the body and o the Stefan-Boltzmann constant. The parameter e is 
called the emissivity; its value lies between zero and unity depending on the nature of the 
surface of the emitting body. The 'ideal' emitter has an emissivity of unity and is known as 
a blackbody. The name arises because it can be shown that a body whose emissivity is unity 
will completely absorb any radiation falling upon it; hence it will appear 'black'. An approach 
to an ideal blackbody emitter can be made by piercing a small hole in an otherwise closed 
cavity, and then, if the cavity is maintained at a uniform, constant temperature, the radia- 
tion leaving the hole is essentially that of a blackbody. Most hot surfaces only approach the 
ideal and consequently have emissivity values of less than unity. 
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EXAMPLE 1.4 Radiated power 



We may estimate the total power radiated from a source of area I0 5 nr al a temper- 
ature of 2000 K, given that the emissivity of the surface is 0.7. 
From eq. (1.38) we have 

W = 0.7 x 5.67 x 10 s x I()- ? x (2000) 4 = 6.35 W 



Secondly it was noted that the spectral distribution of the energy emitted at a given tem- 
perature has a definite maximum and that this maximum shifts to shorter wavelengths as 
the temperature increases, as illustrated in Fig. 1.23. This shift is given by Wien's dis- 
placement law. which can be expressed as 



X.,„r= constant 



(1.39) 




2000 4000 6000 

Wavelength (nm) 

FIG. 1 ,23 Distribution of energy in the spectrum of a blaekbody radiator at various temperatures. 
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where X su is the wavelength at which the radiated power is a maximum for a given temper- 
ature 7'. 

The speetral distribution of blackbody radiation can be described m terms of the spcaral 
radiant ctnitiauce function. W(X,T). where the power radiated from a blackbody per unit 
area between the wavelengths A, and X + dX is given by W(X.T) dX. Planck derived the fol- 
lowing functional form for W(X, T): 



2k/h' 



Wa.TH— ^ —77- ...^ , (1.40) 

A. \cxp(hc/XkT) - 1 J 

Figure 1.23 shows the variation in W{X,T) with wavelength at a number of different tem- 
peratures. 

•Sometimes it is useful to consider the radiation distribution as a function of frequency 
rather than wavelength, when we have 

Wiv.T) = --—{ \ (141) 

We have seen that a close approximation to a blackbody can be constructed bv piercing a 
hole in an otherwise closed cavity: the smaller the hole, the nearer the emitted radiation 
approaches that of a blackbody. This naturally leads to a consideration of the radiation within 
a totally sealed container. If p(v,7)dv denotes the energy density of such radiation between 
the frequencies of v and v + dv where the cavity is at a temperature T, then it may be shown 
(see Problem 1.11) that 

4 
p(i», /)= W(v,T) 

c 

Substituting for W(v, T) from eq. ( 1 .41 ) gives another form of Planck's equation which will 
be found useful in Chapter 5, that is 

Xnliv- I I \ 

""•■"- — (^T^Tj "■«' 

In the derivation of this equation, Planck considered the possible standing wave patterns or 
modes which can exist within a cavity and assumed that the energy associated with each 
mode was quantized; that is. the energy could only exist in integral multiples of some lowest 
amount or quantum. Thus, according to Planck, matter could only emit discrete quantities 
of radiation which were called photons. The success of this assumption, as we mentioned 
in the introduction to this chapter, provided the foundation for the development of modern 
quantum theory. 



1.4.2 Line sources 

In the case of excited gases in which there is little interaction between the individual atoms, 
ions or molecules, the electromagnetic radiation is emitted at well-defined wavelengths. This 
can be understood quite easily on the basis of the simple Bohr mode! of the atom in which 
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it is considered that the atom consists of a positive nucleus of charge Zr (Z being the atomic 
number of the atom) with electrons of mass/;; and charge c in certain 'allowed' bound orbits 
around it. Each of these orbits corresponds to a well-defined energy level. The energv is given 
by 

- /»ZV 

(1.43) 



8/r/re; 



where // is an integer known as the principal quantum number {see e.g. ref. 2.1b). The out- 
ermost electron may be excited from its normal or ground state orbit to higher energy orbits 
which are normally unoccupied. When an electron undergoes a transition from one of these 
excited orbits (or energy levels) to a lower orbit it emits a quantum of radiation. The energy 
of the quantum is just the difference A£ between the energies of the initial and final orbits. 
Thus the quantum energy is 

he 
hv = — = AE (1.44) 

from which, using eq. (1.43), we see that 
AE mc A Z z l 1 1 



v = — = — ,- , 

n ( and /?, being the values of the principal quantum number corresponding to the final and 
initial orbits (or energy levels) involved in the transition. 

The spectral lines emitted in this way can have a very narrow frequency spread; that is. 
they are very nearly monochromatic. In practice, however, there are a number of causes of 
spectral broadening which increase the spread of the wavelength (or frequency) associated 
with the emitted photons (see section 5.7). 

As long as the atoms are in thermal equilibrium with their. surroundings, the energy radi- 
ated by an intense line radiator can never exceed that of a blackbody at the same tempera- 
ture as the line source. This is true despite the very different wavelength distributions of the 
emitted energy of the two sources, even when comparing the energies they emit per unit 
wavelength range (or their spectral radiant emittances). This rule is broken in the case of 
lasers, where, as we shall see in Chapter 5, the atoms are not in thermal equilibrium. 

EXAMPLE 1.5 Ionization energy of the hydrogen atom 



Here we calculate the ionization energy of the hydrogen atom (Z = 1) given the phys- 
ical constants in Appendix 6. The ionization energy is the energy required to excite 
the electron from the ground state (n= 1) to infinity (n = °o). 
Thus from eq. ( 1 .43) we have 

9.1 x IO"" x (1.6 x KT'V 
E 



ion 



8 x (6.6 x 1CT :u x8.85x 1CT 12 ) 2 
= 2.176 x l()- ,K Jor 13.6 eV 
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In concluding this section we return briefly to the photoelectric effect, which dramatically 
illustrates the photon or particle nature of light. Einstein explained the emission of electrons 
from metal surfaces exposed to light as being due to a transfer of energy from a single photon 
to a single electron. It was found that (a) the energy of the emitted electrons depends not on 
the irradiance of the incident light but rather on its frequency, (b) for light of a given fre- 
quency the photoelectrons have a maximum kinetic energy £ m;u (= ( } inv : m ^)) and (c) for a 
given metal there is a minimum, or threshold, frequency v„ for the light which will cause 
electrons to be emitted. These observations are summarized in Einstein's photoelectric 
equation 

£>■»* = >' v -«|> (1.45) 

where e is the electron charge and 6 is a constant for a particular metal known as the work 
function. The quantity r 6 represents the energy required to free an electron from the surface 
(section 2.6). The difference between the incident photon energy hv and e§ then appears as 
the kinetic energy of the emitted electron. 



EXAMPLE 1 .6 Work function of metals 



Given that the lowest frequency of light which will eject electrons from a tungsten 
surface is 1. 1 x 10 15 Hz, we may calculate its work function. 

From eq. (1.45) we have hv {) = e$, where v is the threshold frequency, and we 
assume that the ejected electrons have zero kinetic energy. 

Hence 6 = 4.5 eV. 



(J±) Units of light ~ 

The measurement of the energy of electromagnetic radiation when all wavelengths are treated 
equally is known as radiometry. The measurement of those aspects of radiation which affect 
vision is referred to as photometry. The link between these is the standard luminosity curve 
shown in Fig. 1.24, which shows the spectral response V x of the average eye to light of dif- 
ferent wavelengths. The value of V x (often called the relative luminous efficiency) is taken 
as unity at X = 555 nm where the eye has its maximum sensitivity. The value of V k falls to 
near zero at the extremes of the visible spectrum; that is, at about 400 nm and 700 nm. For 
normal photopic vision (when the eye is adapted for high levels of stimulus) at the peak sen- 
sitivity of the eye (555 nm) 1 watt of radiant energy equals 680 lumens by definition. The 
watt is a radiometric unit whereas the lumen is a photometric unit. At any other wavelength 
this conversion is scaled by the value of the relative luminous efficiency at that wavelength. 
To obtain a measure of the relative brightness of various sources we must use photometric 
units. In laser technology and safety, however, radiometric units are widely used. We shall 
therefore briefly describe the most important radiometric units and give their photometric 
equivalents in Table 1.2. 
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FIG. 1.24 Relative luminous efficiency curve for normal phototopic vis 
for high levels of stimulus). 
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TABLE 1.2 Radiometric and photometric units 



Symbol 

(Si units) Radiometric term and units 



Q 

HE) 



W 



Photometric term and units Definition 



Radiant energy (J) 
Radiant power or flux (W) 
Irradiance (Wm" J > 
Radiant intensity (Wsr -1 ) 

Radiance (Wrrf^sr -1 ) 



Radiant emittance (W r 



Luminous energy (talbot) 
Luminous power or flux (Im) 
Illuminance (Im m"') 
Luminous intensity (Imsr -1 ) 

Luminance or brightness 
(Im rrr-'sr 1 ) 



Total power falling on unit area 
Power radiated by a point 
source into unit solid angle 
Radiant/luminous intensity 
per unit projected area in a 
given direction 
Total power radiated in all 
directions from unit area 



Note: Any of these quantities can be expressed per unit frequency or wavelength. In these cases the word 
'spectral' is added as a prefix to that term and a v or K is added as a subscript to the symbol. Thus, for example, 
the spectral radiance L K is the radiance divided by the bandwidth in wavelength units (jjm, nm, etc.). The spec- 
tral radiance may then have units of W m" ' sr ' nm" 1 or, if the bandwidth is given in frequency units, the spec- 
tral radiance would have units Wm" J sr"' Hz" 1 . 



Of course, light is a form of energy, and radiant energy and power are measured in joules 
and watts respectively. Radiant power is sometimes referred to as flux O; the flux per unit 
area delivered to a surface is the irradiance I. It should be noted at this point that many texts 
on optics and lasers call the power per unit area the intensity. As we mentioned in footnote 
1, however, this practice should not be encouraged and accordingly in this text we shall use 
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irradiance. The recommended symbol for irradiance is £ (or Ej but to av oid confusion with 
/:' used for energy we shall use the symbol /. 

The energy emitted from a point source is described in terms of the radiant intensity '+ c . 
which is the radiant flux emitted by a point source within a unit solid angle in a given direc- 
tion. 7\. is therefore measured in watts persteradian (W sr 1 ). 

For small plane sources, the equivalent term to radiant intensity is the radiance L c . which 
is the power radiated into unit solid angle, in a given direction per unit projected area of 
source: that is, L c has units W m~- sr" 1 . Several laser texts refer to this quantity as the bright- 
ness, which is possibly the most frequently used and also the most confusing term in this 
subject. The photometric quantity corresponding to irradiance is illuminance, defined as the 
flux delivered to a surface per unit area. The visual effect of this energy is characterized by 
the luminance or brightness. Brightness is also often used to describe psychological per- 
ception. The safest procedure would appear to be to take careful note of the units of the terms 
used in a given discussion. 

The more commonly used radiometric quantities and their photometric (or luminous) 
equivalents are shown in Table 1 .2. To reduce the profusion of nomenclature which existed 
in the recent past, the same symbols are used for corresponding radiometric and photomet- 
ric quantities. When it is necessary to distinguish between radiant and luminous quantities 
the subscripts e (for energy) and v (for visual) are used. 

The photometric quantity luminous intensity J v is one of the seven quantities chosen a.s 
dimensionally independent base quantities in the SI system of units. The units of luminous 
intensity, namely lumens per steradiun (1m sr" 1 ). are often referred to as candela (cd). The 
luminance, therefore, has units of cd m" 2 or, more commonly in practice, cd cm 2 despite 
the centimetre not being a recognized SI unit. Quite often Imperial units are used and the 
situation is further complicated depending on whether the emitting or reflecting surface is 
a Lambert ian surface or a uniform diffuser - see section 4.8 and rcf. 1.8. 



NOTES 



1. In the past the word intensity has been used for the flow of energy per unit time per unit 
area. However, by international agreement that term is being replaced by irradiance. 

2. In the literature on optics one meets the terms reflectivity, reflection coefficient and 
reflectance. Unfortunately, these terms are often defined in different ways. For 
example, reflectance is often used for the ratio of the reflected energy per unit area 
(or flux) to the incident flux and also for the ratio of the reflected to the incident ampli- 
tude. We shall adopt the practice suggested by the Symbols Committee of the Royal 
Society (1971 and 1975) and use reflectance for the ratio of the reflected to the inci- 
dent flux. 

\. Here, and elsewhere in the literature, for simplicity the irradiance is taken to be the square 
of the amplitude of the electric vector. In reality, the irradiance is given by the Poynting 
vector S = % x 9€. For example, the time average of the Poynting vector for a plane wave 
is the quantity to which our eyes or a detector would be sensitive and is given 
by (S) = \ r t) c\f J '.where f- n is the maximum amplitude of the electric field oscillations. 
Thus (S) = 1.33 x 10"" | / p Win" 2 in free space (see e.g. ref. Lie). 



problems :r> 



4. The recommended symbol for radiant intensity is /, but we intend to use that symbol for 
irradiance; the radiant intensity is not frequently used in the text. 



PROBLEMS 



l.t Express cq. (1.6) in alternative forms such as t = <=,, cos[27rO/7'-.v/X)] = 
I „ t-os[k(v! -.V)]. Write down the equation of the wave which is (a) 90° out of phase 
with these equations and (b) travelling in the negative .v direction. 

1.2 By considering the superposition of two waves which differ in angular frequency and 
wavenumber by the small amounts 8co and M, derive eq. (1.8). 

1.3 Given that the average solar constant is of the order of 700 Wm" : in the United 
Kingdom calculate the corresponding electric field amplitude. 

1.4 Derive Malus's law, eq. (1.15). 

1 .5 Derive the results of eq. ( 1 . 1 7) for the superposition of two waves by (a) treating the 
two waves as vectors of magnitudes proportional to the wave amplitude and direction 
given by the phase of the wave and (b) expressing the waves as complex exponential 
functions. Show that the energy of the resultant of adding five waves of equal ampli- 
tudes t u and phase constants 0, n/4. 7t/2, 3tc/4. n is ^(3 + 2V2). Show also that 
the resultant energy of superposing an infinite series of waves of amplitudes <<■„, f Jl, 
^■o/ 4 - ^0/8, . . . and phase constants 0. 7t/2, ti. 3tt/2, ... is 4^/5. 

1.6 Show that the fringe spacing in a Young's slits experiment is given by 8v = Jlv///. If the 
aperture to screen distance is 1.5 m and the wavelength is 632.8 nm. what slit separa- 
tion is required to give a fringe spacing of 1.2 mm? If a glass plate (n =1.5) of 0.05 mm 
thickness is placed over one slit, what is the lateral displacement of the fringe system? 

1.7 Verify eq. (1.25) for interference effects in thin films: explain the coloured appear- 
ance of thin films of oil on wet surfaces or of soap bubbles. 

In an experiment to observe fringes of equal inclination, the source emits wave- 
lengths of 500 nm and 600 nm. It is observed that a bright fringe is obtained for both 
wavelengths when the light passes through the film at cos -1 0.7 to the normal, and that 
the next such coincidence occurs at an angle cos" '0.8. Deduce the optical thickness 
of the film and the orders of the fringes for which the two coincidences occur. 

1.8 Explain the phenomenon of 'blooming*. Calculate the thickness of a 'blooming' layer 
of refractive index 2.0 deposited onto a glass substrate of refractive index 1 .5 which 
will give (a) maximum and (b) minimum reflected light. Assume normal incidence 
and that X = 500 nm. 

1 .9 What is the maximum number of orders that can be observed using a plane grating of 
300 lines mm" 1 for normally incident light of wavelength 546 nm? If light of all wave- 
lengths from 400 to 700 nm were used, what wavelengths would be superposed on 
the 546 nm wavelength in the highest of these orders? 

1.10 A collimated beam of light from an Hc-Ne laser (X = 632.8 nm) falls normally onto 
a circular aperture of 0.5 mm diameter. A lens of 0.5 m focal length placed just behind 
the aperture focuses the diffracted Iiuht onto a screen. Calculate the distance of the 
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first dark ring from the centre of the diffraction pattern. Using the same arrangement, 
what is the minimum separation of another source which could just he resolved if the 
two sources are 10 m from the aperture? 

1.11 Show that the functions which describe the energy densitv within an enclosed cavity. 
p( v, T), and the power emitted by a blackbody of unit area. H'( \\ T). are related by the 
equation 

W(v.T) = -p(v.T) 

(Hint: consider a .small hole made in the side of an otherwise enclosed cavity and deter- 
mine by integration over a hemisphere of radius c within the cavity how much of the 
radiation which is inside the cavity will pass through the hole in 1 second.) 

1.12 Calculate the difference in energy between the Bohr orbits for hydrogen (Z= 1) for 
which n - 4 and n = 2. What is the wavelength associated with the photon emitted by 
an electron which undergoes a transition between these levels? 

1.13 Calculate the minimum frequency of light which will cause the photoemission of elec- 
trons from a metal of 2.4 eV work function. What is the maximum kinetic energy of 
the photoelectrons emitted by light of 300 nm wavelength? 
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Elements of solid state physics 



To understand the operation of many of the optoelectronic devices discussed in later chap- 
ters, we need at least an appreciation of the solid state physics of homogeneous materials 
and of the junctions between different materials. Accordingly, we now examine the mech- 
anisms by which current flows in a solid, why some materials are good conductors of elec- 
tricity and others are poor conductors, and why the conductivity of a semiconductor varies 
with temperature, with the concentration of impurities it contains, and with exposure to light. 

One of the physical models which has been quite successful in explaining these and related 
phenomena is the energy band model of solids. Before describing this, however, we shall 
review some relevant concepts from quantum physics. As in section 1.4 it is assumed that 
the reader is familiar with much of the basic physics of isolated atoms including the descrip- 
tion of atomic energy states in terms of four quantum numbers and the application of the 
Pauli exclusion principle leading to the electron configuration of atoms (ref. 2.1). 

Because of space limitations, several of the equations presented in later sections will not 
be derived in detail. The reader will be guided through their derivation, however, via the 
problems and the references presented at the end of the chapter. 



Review of some quantum mechanical concepts 

In Chapter 1 we saw that light displays a dual nature, that of particle and wave. Now the 
energy of a light particle, the photon, can be written as E = hv, where v is the frequency of 
the wave associated with the photon. As the rest mass of the photon is zero, its momentum 
p can be written as 

p = E/c 

Therefore 

p = hvjc = hj\ (2.1) 

where X is the wavelength associated with the photon. This equation, known as the de Brogiie 
relation, can be applied quite generally to particles such as electrons, neutrons and atoms, 
which also exhibit a dual nature as demonstrated, for example, by electron and neutron dif- 
fraction. 

The question as to what is meant by the wave to be associated with a particle immedi- 
ately arises. In 1926, Born showed that the wave amplitude is related to the probability of 
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locating the particle in a given region of space. More specifically, in quantum mechanical 
problems, as we shall see below, we attempt to find a quantity T called the wuvcfmiction, 
which while having no direct physical meaning itself, is defined in such a way that the prob- 
ability of finding the particle in the region of space between .v and .v + dv. v and y + dv. z and 
z + d: is given by H'* 1 *' dv dv dr, where ¥* is the complex conjugate of T (4* may be 
complex, i.e. involve the square root of -1). Clearly if the particle exists then 



j: 



4 / *4 / dvdvd== 1 



(2.2) 



y*^ or \*¥ | 2 is called the probability density function. 

A real particle, which is localized, cannot be described by a wave equation of the form 
of eq. ( 1 .6) say, which is of infinite extent. Rather, the wave description of a particle is given 
in terms of a wave packet (see section 1.2), which is the sum of individual waves with varying 
amplitudes and frequencies. These waves interfere destructively except for a certain region 
in space where the probability of finding the panicle is high (Fig. 2. 1 ). The speed of the par- 
ticle v is the same as the group velocity of the wave packet, that is from eq. < 1 .8) 

dm 

* dk 

The probability of finding the particle is greatest at the centre of the wave packet. ,v = 
in Fig. 2.1 ; however, there is a smaller but finite probability of finding it anywhere in the 
region Ax. Now it may be shown from the Fourier integral (ref. 2.2) that the narrower is Ay. 
that is the more accurately the position of the particle is known, the wider must be the range 



* ,, 




FIG. 2.1 Wave packet of length Ax. 
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of wavelengths A A. comprising the wave packet. From eq. (2.1) this implies a greater uncer- 
tainty. A/\ in the momentum of the particle. Heisenberg showed that in any simultaneous 
measurement of the position and momentum of a particle, the uncertainties m the two mea- 
sured quantities are related by an equation of the form 

AvA/>>/;/2 (2.3) 

Equation (2.3) is called Heisenborg's amcnaiiiiy principle. This implies that it is imposs- 
ible to describe with absolute precision events involving individual particles: we can only 
talk of the probability of finding a particle at a certain position at a given instant of time. 

2.1.1 Schrodinger equation 

In general, the Schrodinger equation includes both space and time dependencies and is o\' 
the form 

2ni\ d\- o\- 6z~ J di 

where / = V -1 . V is the potential energy of the particle and 4* the wavefunction. both of 
which may depend on position and time. That is. we have T = v F(.v. v.;.;) and V'= V'"(.v, v.z.l). 
Equation (2.4) is often referred to as the time-dependent Schrodinger equation. Unfortunately, 
eq. (2.4) is quite complicated and can only be solved in a limited number of cases. For our 
purposes, it is sufficient to consider situations in which the potential energy of the particle 
does not depend explicitly upon time. The forces that act upon it, and hence V, then vary 
only with the position of the particle. In this case. eq. (2.4) reduces to the time-independent 
form, which, in one dimension, may be written as 

d 2 V|/(.v) 2m 

~-^ + - r r[E-V(.\-)]y(.\) = Q (2.4a) 

where \|/ is the time-independent wavefunction and E is the total energy of the particle. 



2.1.1.1 Potential well 

It is quite difficult to find solutions of the Schrodinger equation for most realistic potential 
distributions. However, there are several physical situations, for example a free electron 
trapped in a metal or charge carriers trapped by the potential barriers of a double hetero- 
junction (section 5.10.2.3). which can be approximated by an electron in an infinitely deep, 
one-dimensional potential well. The situation is illustrated in Fig. 2.2, where for simplicity 
we have taken V'(a) = except at the boundaries where V'(a) is infinitely large. That is. the 
boundary conditions are 

V(.v) = 0<.\<L (2.5) 

and 

V(x) = ~ a ^ (), a- > L (2.5a) 
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K(X) 



L x 

FIC. 2.2 One-dimensional potential well of infinite depth. 

Inside the potential well, eq. (2.4a) becomes 

d 2 \|/(.v) 2m 
-—j- + -tE \f{x) = 
dx br 

which is the wave equation for a free particle in a region where V{x) = 0. 
A possible solution to eq. (2.6) is 



(2.6) 



(2.7) 



(2.8) 



\\i(x) = A sin kx + B cos kx 

where A and B are constants and 

k 2 = 2mE/h 2 

Applying the boundary conditions given by eqs (2.5) we see that \y(,v) must be zero at the 
boundaries of the well. Otherwise, there would be a non-zero value of |\|/| 2 outside the well, 
which is impossible because a particle cannot penetrate an infinitely high potential barrier. 
Thus as \|/(jr) = at x = 0, B must be zero, and as y(x) = at x = L, k must be defined so that 
sin kx is zero at x = L. That is, kL must be an integral multiple of n. We can therefore write 

y(x)=Asmkx (2.9) 

where 

k = nn/L n =1,2,3,... (2.10) 

Substituting for k from eq. (2.8) we have 

/2mE\ U2 nn 



or 



E=- 



,rlr 
8mL 2 
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(2.11) 



Thus for each value of n the particle energy is described by cq. (2. 1 1 ). We note that the total 
energy is quantized; n is a quantum number. 

The value of A in eq. (2.9) can be obtained by using the normalization condition which 
is expressed by eq. (2.2). We find that A = V2/Z and hence 



iinx 



V„ 



sin 



L 



(2.12) 



The wavefunctions \f n and the corresponding energies E ir which are often called eigen- 
functions and energy eigenvalues respectively, describe the quantum state of the particle. 
The forms of\y„ and the probability density \\\ij 2 are shown in Figs 2.3(a) and (b) respec- 
tively for the first three quantum states (« = 1 , 2, 3). 

This solution for a one-dimensional potential well can be extended quite easily to the more 
realistic case of three dimensions, where, assuming that the sides of the potential well are 
all the same length, the eigenfunctions are given by 

vj/„ = (8/L) 1/2 sin k t x sin k-y sin k y z 



Energy in units 



M*)l 




Mx) 



M 




x = 



FIG. 2.3 Ground state and first two excited states of an electron in a potention well; (a) the electron 
wavefunctions and <b) the corresponding probability density functions. The energies of these three stales 
are shown on the right. 
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where 

k : = u {kIL k 2 = n 2 njL k^ = n-jiJL 
The energy eigenvalues are still given by eq. (2.1 1 ), that is 

2 i 2 

E = 



8hjL~ 

but now we have 

n 2 = n 2 , + n] + n\ (2.13) 



: i 



We see that each energy state is defined by a set of three quantum numbers, each of which 
yields a different eigenfunction vy. From other evidence, it is found that each electron eigen- 
function has one of two states of opposite spin sign associated with it. These two states are 
described in terms of the quantum number m r which can take either of the two values +; 
or -\ . Thus fourquantum numbers (i.e. n^n^n^mj are needed to define a quantum state 
completely. In fact the need for four quantum numbers is general in any three-dimensional 
potential, although each potential will give rise to its own unique set of quantum numbers, 
and these may well describe the quantization of quite different physical parameters. For 
example, in a hydrogen atom an electron may be considered to be moving under the influ- 
ence of a single positive point charge. Its quantum state is characterized by the set of four 
numbers n, /, m, and m s . Here. /? is the principal quantum number, mentioned in section 1 .4, 
which determines the electron energy, / is the orbital angular momentum quantum number, 
which determines the angular momentum of the electron, while in, is the magnetic quantum 
number, which determines the orientation of the angular momentum vector. Only the spin 
quantum number is the same in these two cases. 

Whilst solutions of the Schrodinger equation tell us what states are available to a single 
electron, we still need to know which of these states are occupied, especially if we have a 
situation where a number of electrons are involved. At first sight we might expect that all 
the electrons would occupy the lowest energy states possible. The implications of this are 
that in a multielectron atom, for example, the electrons would all tend to occupy the same 
energy level. Since the chemical properties of an atom are determined by the types of levels 
the electrons occupy, this assumption would give rise to the atoms of the various elements 
all having very similar chemical properties. Manifestly this is not the case! In fact electrons 
arc found to obey the Pauli exclusion principle. This forbids any two electrons having the 
same set of quantum numbers. Thu.s if we have several electrons (assumed to be non-inter- 
acting for .simplicity) within a three-dimensional potential well, not more than two of them 
can have the same values for «,, n 1 and /;-,, and in that case they must then have different 
values of /??.. 



v 2 - 2 ) Energy bands in solids 

As isolated atoms are brought together to form a solid various interactions occur between 
neighbouring atoms. The forces of attraction and repulsion between atoms find a balance 
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at the proper interatomic spacing for the crystal. In the process important changes occur in 
the electron energy levels, which result in the varied electrical properties of different classes 
of solids. 

Qualitatively, we can see that as the atoms are brought closer together the application 
of the Pauli principle becomes important. When atoms are isolated, as in a gas, there is 
no interaction of the electron wavefunctions: each atom can have its electrons in identi- 
cal energy levels. As the interatomic spacing decreases, however, the electron wave- 
functions begin to overlap and, to avoid violating the Pauli principle, there is a splitting 
of the discrete energy levels of the isolated atoms into new levels belonging to the col- 
lection of atoms as a whole. In a solid many atoms are brought together so that the split 
energy levels form a set of hands of very closely spaced levels with forbidden energy gaps 
between them, as illustrated in Fig. 2.4. The lower energy bands are occupied by electrons 
first; those energy bands which are completely occupied {i.e. full) are not important, in 
general, in determining the electrical properties of the solid. On the other hand, the elec- 
trons in the higher energy bands of the solid are extremely important in determining many 
of the physical properties of the solid. In particular the two highest energy bands, called 
the valence and conduction bands, are of crucial importance in this respect, as is the for- 
bidden energy region between them which is referred to as the energy gap. £„. In differ- 
ent solids the valence band might be completely filled, nearly filled or only half filled with 
electrons, while the conduction band is never more than slightly filled. The extent to which 
these bands are. or are not, occupied and the size of the energy gap determines the nature 
of a given solid. 

We may further reinforce our model by considering that in an ideal crystalline solid the 
atoms are arranged in a perfectly periodic array. The potential experienced by an electron 
in the solid is correspondingly spatially periodic, so that, after a distance in the crystal equal 
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FIG. 2.4 Schematic representation of how the energy levels of interacting atoms form energy bands in 
solids. 
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to the lattice spacing, the potential V repeats itself, that is 
V(x) = V{x + a) = V(x+2a) = ... 

where a is the periodicity of the lattice. 

We could now attempt to apply the Schrodinger equation to the problem of electron motion 
within this system. This is difficult unless we choose a very simple array of atoms. One such 
approach, the Kronig-Penney model (ref. 2.3), considers a single electron moving within 
a one-dimensional line of atoms represented by a periodic array of rectangular potential wells. 
Application of the Schrodinger equation in this case shows that not every value of electron 
energy is allowed. In fact we find that there are ranges of allowed energies separated by ranges 
of disallowed energies; that is, the electrons in a solid can occupy certain bands of energy 
levels which are separated by forbidden energy gaps. 

The discontinuities between allowed and forbidden energy values occur at values of the 
wavevector k given by k = ±tmja, where n is an integer (see Problem 2.6). The E-k curve 
then takes the appearance shown in Fig. 2.5(a), in contrast to the smooth dashed curve which 
is the E-k curve for a free electron (V = 0) - see eq. (2.8). The discontinuities arise from 
the interaction of the electrons with the periodic potential V; the corresponding energy band 
arrangement is shown to the right of Fig. 2.5(a). 
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FIG. 2.5(a) Relationship of E-k for electrons subjected to the potential distribution of thi' Kronig -Penney 
model and the corresponding energy band structure. The E-k relationship for free electrons is shown tor 
comparison. 
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The region in Fig. 2.5(a) for which -n/u<k<n/u is called the first Brillouin zone. It is 
often convenient to redraw Fig. 2.5(a) by translating the segments of the E-k curve so that 
they all lie within this range. This is shown in Fig. 2.5(b). (There is more to this procedure 
than simply pictorial convenience, as it has a theoretical justification as well, see ref. 2. If. 
section 2.2.) In three dimensions the situation is obviously more complicated, but it turns 
out that diagrams similar to Fig. 2.5(b) can be drawn corresponding to different directions 
in the crystal. The first Brillouin zone in general has a complicated shape which depends 
upon the crystal structure being considered. However, its boundaries still tie close to n/a. 
although the parameter a has now to be interpreted in terms of the crystal unit cell dimen- 
sions. 



2.2.1 Conductors, semiconductors and insulators 

In real crystals the E-k relationship is much more complicated as we can see from Fig. 2.6. 
which shows the relationships for silicon and gallium arsenide. These depend on the orien- 
tation of the electron wavevector with respect to the crystallographic axes, since interatomic 
distances and the internal potential energy distribution also depend on direction in the crystal. 
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FJG. 2.5(b) Reduced zone representation of the E-k relationship shown in Fip. 2.5(d). This representation 
is constructed by translating the segments of the E-k curve so that they all lie between k = -n/a and 
k = +n/a, which comprises the first Brillouin zone. 
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FIG. 2.6 Relationship of E-k for real solids: (a! silicon (which has an indire< t bandgap' and 'b' gallium 
arsenide (which has a direct bandgap). The figure shows the conduc tion and \ak-ni c band- and the energy 
gap E t between them. Note that (i) k is specified in different crystallographu dire< lion- 1<> the left and right, 
and (ii) there are holes present with different effective masses [sections 2.2.1 and 2. i' 



However, the basic effect is still energy band formation. One point that arises is that the 
maximum of the valence band does not always occur at the same k value as the minimum 
of the conduction band. We speak of a direct bandgap semiconductor when they do and an 
indirect bandgap semiconductor when they do not. Thus silicon has an indirect bandgap 
whilst gallium arsenide has adirect bandgap, which, as we shall see. leads to differing optical 
properties. For some purposes, however, we may regard the conduction and valence bands 
as being similar to the bands shown in Fig. 2.5(a). 

We may now give a qualitative explanation of why the electrical conductivities of various 
types of solid are different in terms of the energy band model. The differences arise from 
the extent to which the energy levels in the valance and conduction bands are filled by elec- 
trons. The electrons occupy the allowed states (energy levels) in the energy bands, starting 
from the lowest, until they are all accommodated (one electron per state). If the electrons 
in the solid respond to an externally applied magnetic field, and thereby contribute to the 
conductivity, they will acquire energy from the field and move to higher energy levels; that 
is, the field accelerates the electrons and therefore increases their energies. This can only 
occur if there are unoccupied, higher energy levels immediately available to the electrons. 
either within the same energy band, or in the next higher band providing that the energy gap 
is very small. 

In metallic conductors, the uppermost occupied band is only partially filled as shown in 
Fig. 2.7(a). or there is band overlap (Fig. 2.7b), and electrons can gain energy from an exter- 
nal field quite easily, resulting in high conductivity. 

In insulators the upper occupied band, the valence band, is completely filial with electrons 
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FIG. 2.7 Schematic representation of the energy hands in various materials: iai a metal with partially filled 
valem e band, e.g. monovalent metals; (h) a metal with two overlapping partially fitted bands, e.g. divalent 
metals; tc) an insulator; and tdl an intrinsic semiconductor, in this, and sue ceding diagrams, shading is 
used to signify occupied electron energy levels. 



as shown in Fig. 2.7(c). The nearest empty states are in the conduction band, but these are 
separated from the valence band by a large energy gap, large, that is. compared with the 
average thermal energy of the electrons, AT. At room temperature AT is about 1/40 eV, while 
typically £„«4 eV. There are. therefore, very few electrons that can respond to a field, and 
virtually no conduction occurs. 

A similar situation arises in intrinsic semiconductors where at low temperatures the valence 
band is full and the conduction band is empty. In this case, however, the energy gap is suf- 
ficiently small (about 1 eV) that some elections are excited across it at higher temperatures. 
Clearly, the electrons excited into the conduction band can contribute to current flow. 
Similarly, as there are now vacant states in the valence band (Fig. 2.7d), the electrons in that 
band can also respond to an applied field and contribute to the current flow. It is difficult 
to evaluate this contribution in terms of electron movement, however, as there are a very 
large number of electrons and a small number of unoccupied states in the band. It turns out 
that the contribution to the current flow from all the electrons in the nearly full valence band 
is the same as that which would arise from the presence of a small number of fictitious pos- 
itive charge carriers called holes in an otherwise empty band. The number of holes, in fact, 
is simply equal to the number of empty slates in the valence band. Indeed, for many pur- 
poses we may regard a hole as an unfilled state. Holes can be regarded as particles which 
behave in general in a similar way to electrons, apart from having a charge of opposite sign. 

The concept of how holes arise in a solid is most easily illustrated by considering the energy 
band scheme of an intrinsic semiconductor. At any temperature above absolute zero some 
electrons will be excited from that valence band to the conduction band as a result of their 
thermal energy. When electrons make such transitions, as illustrated in Fig. 2.8(a), empty 
states are left in the valence band and we say that electron-hole pairs have been created. 
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FIG. 2.8 Creation of electron-hole pairs by the thermal excitation of electron-, in tm the \aience band to 
the conduction band (a) and electron and hole drift in a semiconductor on ,ippli< jlinn ot a lield / , 'b'- 



If an electric field is now applied to such a solid, electrons in the conduction band and 
holes in the valence band drift in opposite directions as shown in Fig. 2.8(b) and both thus 
contribute to the current because of their opposite charges. 



Q*- 3 y Electrical conductivity 



The charge carriers in a solid are in constant thermal motion. In the absence of external fields 
this motion is quite random resulting from the carriers being scattered by collisions with 
lattice atoms, impurities and crystal defects. As the motion is random there is no net dis- 
placement of charge in any direction and hence no net current flow. 

If an electric field r £ v is applied then the individual electrons will experience a force -ef- x 
(where we have taken e to represent the magnitude of the electronic charge). While this force 
may be insufficient to alter the random path of an individual electron appreciably, the effect 
when averaged over all the electrons is to generate a net motion of the electrons in the neg- 
ative x direction. If the total momentum of the group of electrons in the v direction is/?, then 
we may write 



-ne r &. 



d^ 
dr 



(2.14) 



where n is the electron density. Equation (2.14) indicates a steady acceleration of the elec- 
trons in the x direction. However, they lose momentum in collisions with defects in the crystal 
lattice, such as lattice vibrations (or phonons) and impurities, so that a steady state is reached. 
In the steady state the rate at which momentum is acquired by the electrons due to the accel- 
eration caused by the electric field is balanced by the rate of loss of momentum due to col- 
lisions. Consequently the group of electrons aquires a constant drift velocity t>„. Now if the 
electron collisions are random the probability of each electron colliding at any instant of time 
is constant. Thus if we have a group of n electrons at the instant of time / = 0. and at time t 
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n(t) of these have not undergone a collision, then we may write 

dn(t) 1 

n(t) (2.15) 



dr x 

where t is a constant of proportionality. 

The solution to eq. (2. 15) is n{l) = /?„ e.\p(-r/x). We see that t. which is called the mean 
free time or relaxation time, is the mean time between scattering events, and that the prob- 
ability that an electron has a collision in the time interval d/ is dt/x. Thus the change in 
momentum due to collisions in time d/ is dp K = -/;,d//x. 

The rate of loss of momentum due to collisions, which may be regarded as generating a 
decelerating or resistive force, is then dpjdt=-pjx. Hence in the steady state, from eq. (2. 14) 
we have 

- ^ -nef. = (2.16) 

x 

The average momentum per electron, /7,, \spjn. which from eq. (2.16) is given by 

P,= -ct£, (2.17) 

As expected, eq. (2.17) indicates that the electrons on average have a constant net drift veloc- 
ity of 

p t -£"c£, 
i'» = — = — (2.18) 

where m* is the effective mass of the electron (see below). 

As the electron concentration is //, the current density resulting from this drift is 

J= -nev D (2.19) 

Hence combining eqs (2.18) and (2.19) we have 

ncx 
J = —^'? : > (2.20) 

"K- 

or 

J = <st, (2.21) 

Equation (2.21) is, of course, Ohm's law and a, the electrical conductivity, is given by 

ne~x 
o = —^ = ne\i c 

'"o (2.22) 

The quantity u t .(=ex//;?*) in eq. (2.22) is called the electron mobility. It describes the ease 
with which the electrons drift in the material and it is a very important quantity in charac- 
terizing semiconductor materials. 

The two basic types of scattering that influence electron and hole mobilities are known 
as lattice scattering and impurity scattering. In the former a carrier moving through the crystal 
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encounters atoms which are out of their normal positions because of thermal vibrations. It 
is to be expected that the frequency of such events would rise as the temperature rises, leading 
to a decrease in the mobility with increasing temperature. On the other hand, scattering from 
lattice defects such as ionized impurities (including dopant atoms, see section 2.4.2) should 
be relatively independent of temperature. Thus we would expect lattice scattering to dom- 
inate at high temperatures and impurity scattering to dominate at low temperatures. 

The mobility, as we can see from eq. (2.18), can be defined as the drift velocity per unit 
electric field. Equation (2.20) can be rewritten, using the mobility, as 

J = ne\i c %, (2.23) 

In semiconductors we must include current flow due to both electrons and holes and the 
total current density is 

J = (ne\L e +pe\L b y£ x = &6 x (2.24) 

where u h is the mobility of the holes, and is given by /< h = ez/m*, where m* is the effective 
mass of the holes. 

The concept of effective mass arises when we consider the motion of an electron through 
a crystal when a field is applied. The situation is not governed simply by the laws of clas- 
sical physics, because the electron is influenced not only by the external field but also by 
an internal field, produced by the other electrons, and by the periodic potential of the atoms 
in the crystal. Despite this we may use Newton's law to evaluate the acceleration of the elec- 
tron providing we accept that the electron will exhibit an effective mass m* which is dif- 
ferent from the mass m of a free electron in vacuum. The value of the effective mass depends 
on the energy of the electron within an energy band and is given by (ref. 2.4) 

m^hH-j] (2.25) 




For an electron at the top of an energy band it can be seen from Fig. 2.5(a) that d : E/dA", 
and hence also the effective mass, is negative. Such an electron will be accelerated by a field 
in the reverse direction of that expected for a negatively charged electron. The existence of 
electrons with negative effective masses is closely linked with the concept of holes, which 
have positive effective masses, m* (ref. 2.5). 

As mentioned at the start of section 2.2. 1 the E-k relationships depend strongly on direc- 
tion within the crystal. Hence it might appear from eq. (2.25) that we should have an 
anisotropic effective mass. In general, however, material parameters such as electrical con- 
ductivity that depend on effective mass are found to be isotropic. This is because there are 
often several minima in the E-k diagram, each giving rise to an anisotropic effective mass, 
but each being differently orientated so that when an average is taken an isotropic result is 
obtained. In addition, the use of effective mass in different contexts (e.g. conductivity and 
density of states (section 2.5)) necessitates taking the average in different ways resulting in 
different values for the effective mass. Thus we speak of the 'conductivity effective mass' 
and the 'density of states effective mass'. In silicon, for example, for electrons these are ti.lbm 
and 0.55m respectively. 

To evaluate conductivities from eq. (2.22), in addition to carrier mobilities we also require 
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values for the carrier concentrations. This is considered in more detail in section 2.5. where 
we shall find that carrier concentrations are strongly temperature dependent. As the mobil- 
ities, as mentioned above, are also temperature dependent, the conductivity varies with tem- 
perature in a quite complex way. This variation, however, is the basis of one method of 
measuring the excitation energies of the impurities (section 2.4.2) and energy gaps (ref. 2.6). 



C 2A ) Semiconductors 



2.4.1 Intrinsic semiconductors 



A perfect semiconductor crystal containing no impurities or lattice defects is called an intrin- 
sic semiconductor. In such a material, there are no charge carriers at absolute zero but as 
the temperature rises electron-hole pairs are generated as explained in section 2.2. 1 . As the 
carriers are generated in pairs the concentration n of electrons in the conduction band equals 
the concentration p of holes in the valence band. Thus we have 

n=p = n l 

where /*, is the intrinsic carrier concentration. The value of/?, varies exponentially with tem- 
perature (section 2.5), but at room temperature it is usually not very large. For example, in 
silicon n i ~ 1.61 x 10 16 m 3 at room temperature, whereas there are about 1CP free electrons 
per cubic metre in a typical metal. Consequently the conductivity of metals is very much 
greater than that of intrinsic semiconductors as illustrated in Example 2.1. 

EXAMPLE 2.1 Electrical conductivity of metals and semiconductors 



We may compare the electrical conductivity of copper and intrinsic silicon from the 
following data: for copper we have a density of 8.93 x 10 % kg irf\ an atomic mass 
number of 63.54, a mean free time between collisions of 2.6 x I0~ u s and we assume 
m* = m; for intrinsic silicon we have n=p = n i = 1.6 x 10 lh m~\ an electron mobility 
of 0.135 m 2 V"' s" 1 and a hole mobility of 0.048 m 2 V -1 s" 1 . 

In copper we assume that the free electron concentration is the same as the number 
of atoms per unit volume, which we may determine from Avogadro's number to be 

6x 10 26 x8.93x 10- 

n = = 8.4x 10 2K nr- 

63.54 

Then from eq. (2.22) 

a Cu = 6.4x i0 7 £T' m"' 
For intrinsic silicon, using eq. (2.24) we have 

a Si =*>Ji(u e + M h )= 1.6x 10~ 19 x 1.6x 10 IA x (0.135 + 0.048) 
° r o si = 4.7x lO-'sr'm- 1 
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As at a given temperature there i.s a steads- state carrier concentration, there must be a 
recombination of electron-hole pairs at the same rate as that at which the thermal genera- 
tion occurs. Recombination takes place when an electron in the conduction band makes a 
transition into a vacant state in the valence band. The energy released in the recombination, 
which is approximately equal to E e , may be emitted as a photon or given up as heat to the 
crystal lattice in the form of quantized lattice vibrations, which are called phonon.s. depend- 
ing on the nature of the recombination mechanism. When a photon is released, the process 
is called radiative recombination. The absence of photon emission indicates a non-radia- 
tive process, in which lattice phonons are created. 

We may distinguish between two types of recombination process which we term "band- 
to-band' and 'defect centre' recombinations. (Some texts refer to these as "direct' and 'indi- 
rect' transitions respectively. This terminology has not been adopted here to avoid confusion 
with direct and indirect bandgaps, ref. 2.7.) In the band-to-band process, which is shown in 
Fig. 2.9(a), an electron in the conduction band makes a transition directly to the valence band 
to recombine with a hole. In the defect centre process, the recombination takes place via 
recombination centres or traps. These are energy levels E : in the forbidden energy gap which 
are associated with defect states caused by the presence of impurities or lattice imperfec- 
tions. Any such defect state can act as a recombination centre if it is capable of trapping a 
carrier of one type and subsequently capturing a carrier of the opposite type, thereby enabling 
them to recombine. 

The precise mechanism of a defect centre recombination event depends on the nature 
and energy of the defect state. One such process is illustrated in Fig. 2.9(b). In the first 
step (i) an electron is trapped by the recombination centre, which subsequently captures 
a hole (ii). When both of these events have occurred the net result is the annihilation of 
an electron- hole pair leaving the centre ready to participate in another recom- 
bination event (iii). The energy released in the recombination is given up as heat to the 
lattice. 



Emit 
photo 



on ^ \y 




E,, energy of 
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(i) (ii) (iii) 
(a) (b) 

FIC. 2.9 Illustration of (a) band-to-band recombination and (b) recombination vi.i a dotct I <■ entrc the 
first step in (b) is (i) the trapping of an electron followed by (ii) hole capture. This results in the annihilation 
of an electron-hole pair leaving the centre ready to participate in another reu jmbinat ion (iii). 
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If the thermal generation rate is i> i and the recombination rale is /-, then, in 
equilibrium. 

#, = '•, (2.26) 

Both rates are temperature dependent so that if the temperature is raised. #, increases and a 
new value of carrier concentration n { is established such that the higher recombination rale 
just balances generation. 

At any temperature the probability of an electron recombining is proportional to the number 
of holes present; thus electrons will disappear at a rate proportional to the product of the elec- 
tron and hole concentrations. Therefore we may write, in general, 

r-=Bnp = g x (2.27) 

where B is a constant of proportionality which depends on the recombination mechanism 
taking place (see also section 4.6). For an intrinsic material /?=/; = n t and /', = Bn"~. 

2.4.2 Extrinsic semiconductors 

The number of charge carriers in a semiconductor can be vastly increased by introducing 
appropriate impurities into the crystal lattice. In this process, which is called doping, a crystal 
can be altered so that it has a predominance of either electrons or holes; that is, it can be 
made either /z-type (where the majority earners are negative electrons and the minority car- 
riers are holes) or/;-type (where the majority carriers are positive holes). In doped semi- 
conductors the carrier concentrations are no longer equal and the material is said to be 
extrinsic. 

In doping tetravalent elements, for example silicon, impurities from column V of the peri- 
odic table such as phosphorus and arsenic or from column III such as boron and indium are 
used to produce n-type and p-type semiconductors respectively. The reasons for this are as 
follows. When intrinsic silicon is doped with phosphorus, for example, the phosphorus atoms 
are found to occupy atomic sites normally occupied by silicon atoms as shown in Fig. 2. 10(a). 
Since the silicon atoms are tetravalent only four of the five valence atoms of phosphorus 
are used in forming covalent bonds (ref. 2.8), leaving one electron weakly bound to its parent 
atom. This electron is easily freed; that is, it can easily be excited into the conduction band. 
Therefore in the energy band model the energy levels for the 'extra' electrons associated 
with these impurities lie at £ d , just beneath the conduction band as shown in Fig. 2.10(b). 
Such impurities are referred to as donors, and the energy levels at £ (1 as donor levels, since 
they donate electrons to the conduction band. The energy required to excite an electron from 
the donor levels into the conduction band is £ D , which equals (£ t . - £ d ), where £ c is the energy 
of the bottom of the conduction band. If, as is frequently the case, we take the energy E v at 
the top of the valence band to be zero, then £ g = £ t . and then £ D equals (£„ - £ d ). At absolute 
zero the donor levels are all occupied but, because £ D is so small (about 0.04 eV), even at 
moderately low temperatures most of the electrons are excited into the conduction band, 
thereby increasing the free electron concentration and the conductivity of the material (see 
section 2.3). 

We can estimate £ D as follows. If a phosphorus impurity atom loses its fifth valence elec- 
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FIG. 2.10 Schematic representation of an n-type semiconductor: la) the crystal Iattii v in which a silicon 
atom has been replaced by a phosphorus impurity atom and (b) the energy levek of the donor impurity 
atoms (the diagram shows that some of the impurities have 'donated' an electron to the tonduction band). 



tron it is left with a net positive charge of +e (the impurity is said to have been ionized). It 
can be imagined, therefore, that this electron is bound to its parent atom in a situation which 
is similar to that found in the hydrogen atom, where a charge of +e binds an electron to the 
nucleus. The ionization energy of the hydrogen atom is 13.6eV (sec Example 1.5). but in 
the case under discussion here there are two important differences arising from the fact that 
the electron moves in a solid. First we must use the effective mass m* rather than the free 
electron mass. Secondly the relative permittivity of the semiconductor must be included in 
the derivation of the electron energy levels. This is because the electron orbit is large enough 
to embrace a significant number of silicon atoms so that the electron may be considered to 
be moving in a dielectric medium of relative permittivity e r Therefore, from eq. (1.43), the 
excitation energy E D is given by 



£ D =13.6 




(2.28) 



Suppose, on the other hand, that silicon i.s doped with boron. Again it is found that the 
impurity atoms occupy sites normally occupied by silicon atoms as shown in Fig. 2.1 1(a). 
In this case, however, there is one electron too few to complete the covalent bonding. At 
temperatures above absolute zero an electron from a neighbouring silicon atom can move 
to the impurity to complete the bonding there but, by so doing, it leaves a vacant state in 
the valence band thereby creating an additional hole. For this reason the trivalent impuri- 
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FIG. 2.11 Schematic representation of a p-type semiconductor: (a) the crystal lattice with trivalent impari- 
ties (e.g. boron) and (b) the energy levels of the acceptor impurity atoms (some impurities have 'accepted' 
electrons from the valence band). 



ties are referred to as acceptors as they accept electrons excited from the valence band. It 
is convenient to regard this situation as a negatively ionized acceptor atom with a positive 
hole orbiting around it analogous to the situation described above. The energy E A , which 
equals E a -£ v , required to 'free' the hole from its parent impurity can be estimated as above. 
An average value of the effective mass of holes in silicon is 0.33/?; and, from eq. (2.28), 
E A = 0.032 eV. In reality, of course. E A is the energy required to excite an electron from the 
valence band to the acceptor energy levels, which lie just above the valence band as illus- 
trated in Fig. 2.11(b). 



EXAMPLE 2.2 Ionization energy of donor impurities 



We may estimate the energy required to excite electrons from the donor levels to the 
conduction band in silicon given that m* = 0.26m and the relative permittivity is 1 1.8. 
From eq. (2.28) we then have 



£ D = 13.6x0.26 



1 



= 0.025 eV 



(We may compare this value with the following experimental values for various donor 
impurities in silicon: P 0.045 eV; As 0.05 eV; and Sb 0.04 eV.) 
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In the compound semiconductor gallium arsenide, gallium is from group III of the peri- 
odic table, while arsenic is from group V {such semiconductors are often referred to as group 
III-V compounds; other members of this group are GaP. InP and ALSh). In this case n- and 
p-type doping can be accomplished in a number of different ways. For example, if a group 
II element such as zinc replaces the gallium atoms, the material becomes p-type. whereas 
if a group VI element such as tellurium replaces the arsenic atoms, it becomes n-type. Group 
IV elements such as germanium and silicon can produce either p- or n-type material depend- 
ing on whether the group IV impurity replaces the arsenic or gallium ions respectively. 

Similar remarks apply to the group II- VI compound semiconductors, for example CdS 
and ZnSe. and to the ternary and quaternary semiconductors such as GalnAs and GalnAsP. 



2.4.3 Excitons 



We have just seen that the introduction of suitable impurities into intrinsic semiconductor 
material can result in the formation of electron energy levels situated just below the bottom 
of the conduction band. However, electron energy levels similarly situated can also appear 
in intrinsic material. These arise because the Couiombic attraction of an electron for a hole 
can result in the two being bound together; such a bound electron -hole pair is called an 
exciton. We may picture the exciton as an electron and a hole orbiting about their common 
centre of gravity with orbital radii which are inversely proportional to their effective masses 
as shown in Fig. 2.12. The Bohr model is readily adapted to this situation with the electron 
mass being replaced by the reduced mass /«* of the electron and hole; m? is given by 



1 1 1 
in* m* ml 
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FIC. 2.12 Model of an exciton; the electron e and hole h may be regarded ,is betny; bound ii pettier and 
orbiting around their common centre of gravity C with radii which are inversely prupnrtion.il In their 
effective masses (we have assumed m* > m*). 
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The binding energy £ c is then obtained by modifying cq. (2.28) to give 

£ , = 13 . 6 ^/lY e V (2.2*0 

Since m* will be the same order of magnitude as /»* and m,?, we see that exciton energy 
levels will be similarly placed to those of donor levels in doped semiconductors. 

Excitons may move through a crystalline lattice and thus provide an important means o! 
transferring energy from one point in the material to another. They play an important role 
in the luminescence of solids and will be discussed further in Chapter 4. 



(~2^~) Carrier concentrations 

In calculating semiconductor properties and in analyzing device behaviour it is often nec- 
essary to know the carrier concentrations. In metals wc can make a fairly good estimate ot 
the free electron concentration by calculating the number of atoms per unit volume (from 
the density and atomic mass number of the metal) and multiplying by the valency . Similarly 
in heavily doped semiconductors we can take the majority carrier concentration to be the 
same as the impurity concentration. This may not be the case at high temperatures when the 
number of electron-hole pairs generated by electron excitation across the energy gap may 
be greater than the number of impurities. The situation in near-intrinsic material is not so 
clear, however, nor is the temperature variation of earner concentrations immediately 

obvious. 

To calculate the earner concentration in each energy band we need to know the follow- 
ing parameters: 

1 . The distribution of energy states or levels as a function of energy within the energy band. 

2. The probability of each of these states being occupied by an electron. 

The first of these parameters is given by the density of states function Z(E) which may be 
defined as the number of energy states per unit energy per unit volume. The form of Z(£), 
which is shown in Fig. 2.13(a). can be derived, for example, from eq. (2.1 1), which gives 
the energy levels in a potential well. It is given by (ref. 2.9) 

h 

where E is measured relative to the bottom of the band. 

The second parameter depends on the fact that electrons obey the Pauli exclusion prin- 
ciple and hence the probability of a particular energy level being occupied at temperature 
T is given by Fermi -Dirac statistics. This is in contrast to the case of atoms in an ideal gas 
where the Maxwell-Boltzmann distribution function applies (in fact, in several situations 
electron behaviour can be approximated by Maxwell-Boltzmann statistics thereby simpli- 
fying the mathematics). 
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FIG. 2.1 3 Density of states function (a) and the Fermi-Dirac function at J = K and at !>OK i.b). 



The Fermi-Dirac distribution function is found to be 
1 



F{E) = 



exp[(E-E F )/kT)+\ 



(2.30) 



where E F is a characteristic energy called the Fermi energy. 

The distribution function is shown in Fig. 2.13(b). We notice that at K F(E) is unity 
for energies less than E F and zero for energies greater than £,... At any temperature above 
absolute zero the probability of occupation of the energy level at E = E }: is 0.5. as we 
can see from eq. (2.30). Figure 2.13(b) also shows that the probability of occupation of 
states above £ F is finite for T> and that there is a corresponding probability that states 
below E F are empty. In fact, F(E + E F )= 1 -F(E F -E). This makes the Fermi level a 
natural reference point when calculating electron and hole concentrations. In the case 
of intrinsic semiconductors the 'tails' in the probability distribution extend into the con- 
duction and valence bands respectively as shown in Fig. 2.14(b); the density of states 
function and carrier densities are shown in Figs 2.14(a) and (c). As the electron and hole 
concentrations are equal we expect the Fermi level to lie near to the middle of the energy 
gap- 

In n-type material there are many more electrons in the conduction band than holes in the 
valence band and we expect the Fermi level to lie near the donor levels. Similarly in p-type 
material the Fermi level lies near the acceptor levels. The Fermi level, density of states and 
carrier densities in n-type material are shown in Fig. 2. 1 5. In many cases, for example when 
considering junctions between dissimilar materials, the energies of the impurity levels can 
be ignored if we know where the Fermi level lies. 
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FIG. 2.14 Graphical representation of (at the density of states, (b) the Fermi- Dirac distribution anci (d 
carrier densities for an intrinsic semiconductor. 
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FIG. 2.15 Graphical representation of (a) the density of states, (b) the Fermi-Dirac distribution and (c) 
carrier densities for an n-type semiconductor. 



60 ELEMENTS OF SOLID STATE PHYSICS 

Returning to the calculation of carrier concentrations we see thai thi> is given by summing 
the product of the density of states and the occupancy probability over the energy range of 
interest; that is, 

n=\ F(E)Z(E)dE (2.31) 

bare! 

Taking first the case of a metal at absolute zero, where F(E) = 1 and the upper occupied level 
is E v , we have 

„ = f^(2<)^d£ 

and hence 

n = — } (2m*E F )^ (2.32) 

In fact we often calculate n as described earlier and use eq. (2.32) to calculate E f : typical 
values are 7.0 eV for copper and 11.2 eV for aluminium which are in good agreement with 
experimental results. 

For the case of the electron concentration in the conduction band of a semiconductor we 
replace E in eq. (2.29) by (E-E c ) so that eq. (2.31) becomes 

4tt ^ „, r £ i (E-E ) i/2 

h 3 "' } c exp[(£-£ F )/itr|+l 

where £ T is the top of the band. Making the substitutions and approximations suggested in 
Problem 2.14 we have 



n = N c exp 
where 



kT 



(2.33) 



A/ t . is called the effective density of states in the conduction band; it is constant at a constant 
temperature. 



EXAMPLE 2.3 Effective density of states in the conduction band 



We may calculate the effective density of states for germanium at 300 K given that 
the appropriate effective mass is 0.55m. 

From eq. (2.33) we have N c = 1.03 x 10 25 m"\ 
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By a similar argument, the concentration of holes in the valence hand is given by 

biind 

or 

yj = ^exp[(£ F -£ v )/AT] (2.34) 

where 

(2nm*kT^ 12 
N,. = 21 



fr 

The carrier concentrations predicted by eqs (2.33) and (2.34) are valid for w/v type of semi- 
conductor provided the appropriate Fermi level is used, and Fig. 2. 1 5 shows the carrier dis- 
tributions, together with the functions Z(£) and F{E), for an n-type semiconductor. We may 
use these equations to derive an expression for £ F in intrinsic semiconductors where n = p = n-. 
It is left to the reader to show that 

£ Fi =^+^Tln(</<) (2.35) 

The second term on the right-hand .side of eq. (2.35) is usually very small and, as we sup- 
posed above, £ Fl - \ E 

We also note from eqs (2.33) and (2.34) that the product of n and /; is constant for a given 
piece of semiconductor at constant temperature. That is, 

np = n [ p i = n ] = N L M^ exp(-£ g /AT) (2.36) 

Equation (2.36) is a very important relationship, for once we know n or/; then the other can 
be determined from n\; it is often referred to as the "law of mass action' for semiconductors. 

Equation (2.36) shows that the temperature variation of n l is essentially exponential, that 
is n, oc exp(-£ g /2AT). Similarly, for an n-type semiconductor the excitation of electrons from 
the donor levels to the conduction band is governed by the exponential function exp(-£ D /AT). 
As E D is rather small, we find that at temperatures above about 100 K nearly all the donors 
are ionized, while at such temperatures the number of electron-hole pairs formed by intrin- 
sic excitation is negligible. This situation continues until the temperature rises somewhat 
above room temperature in the case of silicon. Thus the variation of electron concentration 
with temperature has the form shown in Fig. 2.16, the exact details of this variation depend- 
ing on the doping level. 

Finally if we denote the number of electrons having energies between E and £ + d£ in 
the conduction band of a semiconductor as n(E) we may write 



4rc 



/!(£)= -j(,2»t?)"\E-E c ) il2 exp 



h 



e; 



kT 



This function is shown in Fig. 2.14(c) for intrinsic semiconductors, from which it can be 
seen that n{E) rises fairly rapidly to a maximum and then slowly decreases. It can be shown 
that the maximum value of n{E) occurs at an energy of £ L . + AT/2 and that n(E) falls to one- 
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FtC. 2.16 Electron density as a function of temperature in n-type silicon; N',. is the di-n>ih ut donor 
impurity atoms. 



half of this value over an energy range of 1.8AT (see Problem 2.20). Thus we see that the 
electrons within the conduction band are predominantly distributed over an energy range 
of about 2kT. Similar comments apply to the distribution of holes in the valence band. We 
shall return to this point in section 2.9. 



( 2 - 6 i Work function 



We saw in section 1.4 that when light of an appropriate frequency falls onto metals (and 
indeed onto other solids) electrons may be emitted. Similarly, when solids are heated 
thermionic emission of electrons may occur. The minimum energy required to enable an elec- 
tron to escape from the surface of the solid, in either case, is called the work function <t>. 

A simple model of this situation in the case of a metal is shown in Fig. 2. 17(a), which 
represents an electron trapped in a well of finite depth (i.e. one from which it may escape). 
We see that § is the energy difference between the Fermi level (which corresponds to the 
highest occupied energy level) and the vacuum level. We consider that when the electron 
reaches an energy equivalent to the vacuum level it has escaped from the metal. We might 
compare this situation with the ionization of an atom. 

In semiconductors the work function is also defined as the energy difference between the 
Fermi and vacuum levels (Fig. 2.17b), but in this case it is more usual to use the electron 
affinity %, defined as the energy difference between the bottom of the conduction band and 
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FIG. 2.1 7 Diagram showing the work function of (a) a metal and (b) an n-type semiconductor. 



the vacuum level. In some respects % has more physical significance than <j> as. in general, 
there are no electrons at the Fermi level in semiconductors. 



dD" 



Excess carriers in semiconductors 



Most semiconductor devices operate by the creation of charge carrier concentrations in excess 
of the thermal equilibrium values given by eqs (2.33) and (2.34). These excess concentra- 
tions may be created by optical excitation or carrier injection via a suitable contact. In optical 
excitation electron-hole pairs are generated when the energy of the incident photons is suf- 
ficient to excite an electron from the valence to the conduction band, that is when hv>E ,. 
Since additional carriers are thereby created the electrical conductivity of the material will 
increase. This is the basis of the photoconductive devices described in section 7.3.5. 

If a carrier excess is created in this way and the exciting radiation is then suddenly cut 
off, the carrier concentrations will return gradually to their thermal equilibrium values owing 
to recombination of the excess carriers. During this time, the recombination rate, being pro- 
portional to the increased carrier concentrations, will be higher than the thermal generation 
rate, assuming that the temperature remains constant. 

If we suppose that An(t) and Ap(t) are the excess carrier concentrations at any given instant 
(An(t) = Ap(t)), the net rate at which the excess electron-hole pairs disappear is from eq 
(2.27), 



dn(t) dAn(t) 



df 



dr 



= B[n + An(t)][p + Ap(t)]-Bn 1 - 



(2.37) 



where the total concentration of electrons at time t is n(t) = n + An{t), n being the equilib- 
rium concentration (similarly for holes). As np = n\, and An(t) = Ap(t), we can rewrite eq 
(2.37) as 



dA«(0 
d/ 



= B[(n + p)An(t) + (An(t)) 2 ] 
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If the excess concentrations are not too high (i.e. we have low level injection) we can ignore 
the term (An(t)) 2 . Furthermore, if the material is extrinsic ue can usually neglect the minor- 
ity carrier concentration. Thus, if the material isp-type we have/>s>». and hence we may write 

dA/j(0 

= BpA)i(t) 

d/ 

The solution to this equation is 

A»(r) = A/KO) e\p(-Bpt) = A/?(0) expf- t/z t ) (2-38) 

where A/i(0) is the excess carrier concentration at t = when the exciting source is switched 
off and the decay constant, x e = l/Bp, is called the minority earner rveombimition lifetime 
(minority, because the calculation is made in terms of the minority carriers). Similarly, the 
minority carrier lifetime of holes in n-type material is x h = (BnY [ . Physically t represents 
the average time an excess carrier remains free before recombining directly. The earner life- 
time for indirect recombination is more complicated than in the above case since it is nec- 
essary to include the time required to capture each type of carrier. 



EXAMPLE 2.4 Minority carrier lifetimes 



We may calculate the minority carrier recombination lifetime for direct recombina- 
tion in gallium arsenide and silicon for a donor concentration of 5 x 10 :4 in" ' using 
the data provided in Appendix 6. 

We see above that the hole lifetime in n-type material is given by x h = \/Bu. Hence 
1 
7.2 x 10-^"x5xl0 24 



forGaAs t h = =278ps 



1 

for Si x h = - — = 1 1 1 us 

h 1.8xl(T 2I x5xl0 24 ^ 

We thus see that direct, radiative recombination is much more likely in gallium arsenide 
than in silicon. 



2.7.1 Diffusion of carriers 

Let us suppose that a concentration gradient of excess minority carriers is created in a rod 
of n-type semiconductor by injecting holes into one end of the rod via a suitable contact, as 
shown in Fig. 2.18. Owing to the random thermal motion of the holes at any given location 
it is probable that more holes will move to the right than to the left in a given time interval, 
and there will be a net movement or diffusion of holes along the rod down the concentra- 
tion gradient. The net rate of flow of holes across unit area due to diffusion is found to be 
proportional to the concentration gradient, that is 

hole flux = -D h ^ (2.39) 

h dv 
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FIC. 2.18 Minority carrier injection and diffusion. 

Similarly for electrons we have 
d/?(.v) 



electron flux = - D . 



e dv 



(2.39a) 



D h and D e are the hole and electron diffusion coefficients. These parameters are related to 
the mobilities in that they are a measure of the ease of carrier motion through the crystal 
lattice. In fact the so-called Einstein relationships give the diffusion coefficients in terms of 
the mobilities as (ref. 2.10) 



D. 



kT 



■ He.h 



(2.40) 



As the holes diffuse along the rod they will eventually recombine. We now show that 
before recombining, the holes travel a characteristic distance called the diffusion length L h . 
Let us consider a length A.v of the rod situated a distance „v from the injecting contact (as 
shown in Fig. 2.18); then if the cross-sectional area of the rod is A the flow rate of holes 
into the left-hand face of the element is 



-D 



Idr-** 



Similarly, the flow rate out of the right-hand face is 
d 



■D> 



. dv /, dv\ dv 



d/d \ 

+ — f — Ap(.Y)|A.vL4 



The difference of these two terms is the net rate at which the element gains holes, which in 
the steady state must equal the rate of recombination of excess holes; therefore, 

^ d 2 (A;;(.v)) Ap(x)AxA 

- D h — -^ — AxA = 

dr x,. 
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= 



(2.41) 



or 

d 2 A/;(x) A/X-v) 

dv 2 x h D h 

The solution to this equation, which is often called the steady state diffusion equation, is 
A/;(.v) = fij exp(.v/L h ) + B 2 cxp(-x/L h ) 

where L h = SD h x h is the hole diffusion length. The constants B , and fl, are determined by 
the boundary conditions, which in the case we are describing we can express as 

ApC*)— »0 asx— >« which implies that fi, must be zero; and 
Ap(x) = A/;(0) at x = 0, so that fl, = Ap(0) 
Hence we have 

A/;(a-) = A/;(0) exp(-x/L h ) (2.42) 

Thus we see that the excess minority carrier concentration decreases exponentially with dis- 
tance; we shall return to this relationship in section 2.8.2. 

2.7.2 Diffusion and drift of carriers 

The diffusion of charge carriers will obviously give rise to a current flow. From eqs (2.39) 
we can write the electron and hole diffusion current densities as 

dn 
J e (diff) = eD — 

d* (2.43) 

dp 
J h (dim = -eD< 

d* (2.43a) 

If an electric field is present in addition to a concentration gradient, we can use eqs (2.24) 
and (2.43) to write the current densities for electrons and holes as 

d/t 

7 e - e\i c ri£ + eD £ — 

dx 

dp 
dx 
and the total current density is the sum of these contributions, that is 



®" 



Junctions 



The majority of the most useful electronic devices contain junctions between dissimilar mate- 
rials, which may be metal-metal, metal -semiconductor or semiconductor semiconductor 
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combinations. We shall concentrate our attention initially on p-n honmjunctimis. in which 
a junction is formed between p and n variants of the same semiconductor. Such junctions, in 
addition to their rectifying properties, form the basis of photodiodes. light-emitting diodes 
and photovoltaic devices, which will be discussed in later chapters. Many other devices, for 
example bipolar transistors and thyristors, contain two or more such junctions. 

2.8.1 The p-n junction in equilibrium 

A p-n junction may be fabricated as a single crystal of semiconductor by a number of dif- 
ferent techniques (ref . 2.11). Indeed the exact behaviour of a junction depends to a large extent 
on the fabrication process used, which in turn determines the distances over which the change 
from p- to n-type nature occurs. For mathematical convenience we shall assume that the junc- 
tion is abrupt, that is there is a step change in impurity type as shown in Fig. 2.19, which 
also shows the corresponding carrier concentrations p n and >;,,, /;„ on the p and n sides 
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FIG. 2.19 Abrupt p-n junction showing (a) the impurity concentrations and (b) the carrier densities on the 
two sides of the junction. 
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FIG. 2.20 Schematic representation of the formation of a p-n jun< Hon. m> imti.ilK M-paratod p-tvpe and 
n-type materials; (b) the energy band distribution after the junction is lormed; u ■ the -paie charge layers of 
ionized impurity atoms within the depletion region W; and (d) the potential distribution at the junction. 



of the junction respectively. These apply only at relatively large distances from the junc- 
tion; close to the junction they are modified as we shall see below. 

While the assumption of a step junction may be used to establish many of the character- 
istics of junctions, it is not universally applicable. For example, in many cases the junction 
approximates to a linearly graded one in which there is a gradual change in doping type. 
One or two of the ways in which such junctions differ from abrupt ones will be pointed out 
in due course. 

Although not the case in practice, let us assume that the junction is formed by bringing 
initially isolated pieces of n-type and p-type materials into intimate contact. Then, since there 
are many more holes in the p-type than in the n-type material, holes will diffuse from the p 
to the n region. The holes diffusing out of the p-type side leave behind 'uncovered' or ionized 
acceptors, thereby building up a negative space charge layer in the p-type side close to the 
junction. Similarly electrons, diffusing into the p-type side, leave behind a positive space 
charge layer of ionized donors as shown in Fig. 2.20(c). This double space charge laser causes 
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an electric field to be set up across a narrow region on either side of the junction, directed 
from the n- to the p-type region as shown. 

The direction of the junction electric field is such as to inhibit further diffusion of the major- 
ity carriers, though such diffusion is not prevented altogether. This must be so since the elec- 
tric field will sweep minority earners across the junction so that there is a drift current of 
electrons from the p- to the n-type side, and of holes from the n- to the p-type side, which 
is in the opposite direction to the diffusion current. The junction field thus builds up until 
the drift and diffusion current flows are equal in magnitude, at which stage the Fermi level 
is constant across the junction as shown in Fig. 2.20(b) indicating that equilibrium has been 
reached within the crystal as a whole. Thus, as there is no net current flow in equilibrium, 

y h (drift)+y h (diff) = 

and 

7 c (drift) + 7 e (diff) = 

The induced electric field establishes a contact or diffusion potential V {) between the two 
regions and the energy bands of the p-type side are displaced relative to those of the n-type 
as shown in Fig. 2.20(b). The magnitude of the contact potential depends on the tempera- 
ture and the doping levels as we shall see below. The contact potential is established across 
the space charge region, which is also referred to as the transition or depletion region, so 
called because this region has been almost depleted of its majority earners. As a consequence 
it is very resistive relative to the other (so-called bulk) regions of the device. 

An expression relating the contact potential to the doping levels can be obtained, for 
example, from eq. (2.33). From this equation and adopting the notation used in Figs 2.19 
and 2.20 we can write the electron concentration in the conduction band of the p-type side 
as 



n = N K exp 



, kT 



Similarly the electron concentration in the n-type side is 



" n = ^ L .exp 



kT 



As we mentioned above, the Fermi level is constant everywhere in equilibrium so that 
E Fp = £ Fn = £ F say. Hence, eliminating N c wc have 

E-E=kT\n('-^) = eV n 



Therefore 



kT /»„> 
V„ = — In - 



(2.44) 
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At temperatures in the range 100 K < 7~< 4(X) K the majoril> carrier concentrations are equal 
to the doping levels, that isn n = iVj anip =\\. and remembering np = n: we can write eq. 
(2.44) as 



kT 
V = - 



H™) 



(2.45) 



EXAMPLE 2.5 Equilibrium p-n junction contact potential difference 

The contact potential difference V n in a given germanium duxle ma> bo calculated trom 
the following data: donor impurity level /V (1 =10 ;: ni '; acceptor impurity level 
N. a = 10 24 m~ 3 ; and intrinsic electron concentration «, = 2.4 x 10'" m '. 

Then assuming room temperature (T~ 290 K) we have, from eq. (2.45). that 

/ 10 22 x 10 24 \ 

V (j = 0.025 In zn I = °- 42 v 

\(2.4xlO iy ) 2 / 



Equation (2.44) gives us a very useful relationship between the carrier concentrations on 
the two sides of the junction, that is 

. ~ eV o \ p46) 



and similarly 



2.8.2 Current flow in a forward-biased p-n junction 

If the equilibrium situation is disturbed by connecting a voltage source externally across the 
junction there will be a net current flow. The junction is said to befonmrd biased if tne P 
region is connected to the positive terminal of the voltage source as shown in Fig. 2.21(a). 
As we mentioned in the last section, the depletion region is very resistive in comparison with 
the bulk regions so that the external voltage V is dropped almost entirely across the deple- 
tion region. This has the effect of lowering the height of the potential harrier to ( I „ - V ) as 
shown in Fig. 2.21(b). Consequently, majority carriers are able to surmount the potential 
barrier much more easily than in the equilibrium case so that the diffusion current becomes 
much larger than the drift current. There is now a net current from the p to the n region in 
the conventional forward sense and carriers flow in from the external circuit to restore equi- 
librium in the bulk regions. We note that with the application of an external potential the 
Fermi levels are no longer aligned across the junction. 

The reduction in height of the potential barrier leads to majority carriers hemg injected 
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FIG. 2.2 
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1 Forward bias voltage V applied to a p-n junction (a) and the resulting energy band strut ture (I)). 



across the junction. On being so injected, these carriers immediately become minority car- 
riers, and the minority carrier concentrations near to the junction rise to the new values 
n' p and /),',. This establishes excess minority carrier concentration gradients, as shown in 
Fig. 2.22, so that the injected carriers diffuse away from the junction. This situation is pre- 
cisely the same as that described in section 2.7.1 and thus, considering the n region, the 
injected holes diffuse away from the junction recombining as they do so. The electrons lost 
in this recombination are replaced by the external voltage source so that a current flows in 
the external circuit. A similar argument applies to the p region, with the roles of electrons 
and holes reversed. It should be noted that the majority carrier concentrations are not notice- 
ably changed as a consequence of the injection (Fig. 2.22) unless the bias voltage is almost 
equal to V l} , resulting in a very large current flow. 

The drift current is relatively insensitive to the height of the potential barrier since all the 
minority carriers generated within about a diffusion length of the edge of the depletion region 
may diffuse to the depletion region and be swept across it, whatever the size of the electric 
field there. 

By the same argument that was used to give eqs (2.46), we see that with forward bias 
the minority carrier concentrations in the bulk regions adjacent to the depletion layer 
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FIG. 2.22 Carrier densities in the bulk region of a forward-biased p-n junction diode. Owing to carrier 
injection, the minority carrier densities close to the depletion layer are greater than the equilibrium values. 
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become 



n =/i n exp 



e(V (1 -V) 
kT 



(2.47) 



and 



p n = pe\p 



■e(V l} -V) 
kT 



(2.47a) 



Then taking eqs (2.46a) and (2.47a). for example, we have 



Pn=/? n expf — 



(2.48) 



As we noted above, the excess minority carrier concentration will decrease owing to recom- 
bination in accordance with eq. (2.42) so that we may write the excess hole concentration 
in the n region at a distance x from the edge of the depletion layer as 

Ap(x) = Ap(0)exp(-x/L h ) 

where now 

Ap(x)=p' n (x)-p n 

and Ap(0) is the value of Ap(.v) at .v = 0, that is (p' n -p n ). Using eq. (2.4X) we can therefore 
write 



Ap(0)=p n [exp(eV/kT)-\] 



(2.49) 



We have assumed a one-dimensional carrier flow in the .v direction, which is an acceptable 
approximation even though in practice carrier flow occurs in three dimensions. 

Now, we have argued that the electric fields in the bulk regions are very small and there- 
fore, adjacent to the depletion layer in the n region, and in particular at v = 0. the total current 
density will be due to diffusion only. Thus the current density due to hole motion is given 
by eq. (2.43b). Hence differentiating eq. (2.42) and substituting into eq. (2.43b) gives 

At x = 0, we can write this, using eq. (2.49), as 



'h = - 



eD v 



-P n 




(2.50) 



There is a similar contribution due to electron flow, that is 



eD e I (eV\ 

J c = n expl — 1-1 

L. p XkT} 



(2.50a) 
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The total current density is therefore given by 
./ = ./„ expj'^l-1 



(2.5 1: 



where 



, D " 
e 'u''' 



D. 



(2.51a) 



J tt is called the saturation current density; it represents the current flow when a few volts of 
reverse bias is applied to the junction (see section 2.8.3). 



EXAMPLE 2.6 Saturation current density 



We may calculate the saturation current density in an abrupt silicon junction given the 
following data: N A = 10 21 m~\ N a = 10" m"\ D t . = 3.4 x 10~' nrV 1 ; D h =1.2x 
1 IT 3 nr s _I ; L L . = 7. 1 x 1 0~ 4 m; L h = 3.5 x 1 (T 4 m; and n, = 1.6x1 () 16 irT \ 

Assuming that all of the impurities are ionized, we have n n = A/ (i = I() :i m" 3 and there- 
fore 



/V 



2.56 x 10- 1 



2.56 x 10" nr* 



Similarly ;? p = 2.56 x 10" m '. Therefore from eq. (2.5 la) ./„ = 1.6x 10" 7 A nr : . 

A typical discrete diode may have a junction area of about 10"'' m 2 and hence the 
reverse bias saturation current would be /„ = 1.6 x 10~ 13 A. 



2.8.3 Current flow in a reverse-biased p-n junction 

In this case the external bias is applied so that the p region is connected to the negative ter- 
minal of the voltage source as shown in Fig. 2.23(a). This has the effect of increasing the 
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FIG. 2.23 Reverse bias voltage V applied to a p-n junction (a) and the resulting energy band structure (b). 
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FIG. 2.24 Carrier densities in the bulk regions of a reverse-biased p-n jun< turn di'Hie Owing to carrier 
extraction the minority carrier densities close to the depletion laver <w le-^ than the equilibrium values. 



height of the potential barrier to V +V (Fig. 2.23b), thereby reducing the diffusion current 
to negligible proportions. The net current flow is therefore the drift current which is directed 
in the conventional reverse sense, that is from the n to the p region. This results in earner 
extraction rather than injection because the minority carriers generated near the junction 
diffuse to it and are swept across the depletion region. The nearer the carriers are generated 
to the junction the greater is the probability of this occurring so that a concentration gradi- 




FIC. 2.25 Current-voltage characteristics of a p-n junction diode. The reverse saturation < unetil i„ is 
equal to J () multiplied by the junction cross-sectional area. 
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enl is formed towards the junction, as illustrated in Fig. 2.24. The drift of carriers across the 
junction is therefore 'fed - by diffusion so that we may use precisely the same arguments to 
derive the current- voltage relationship as we used in the previous section. Equations (2.5 1 ) 
also apply in this case, the only difference being, of course, that the sign of V is changed. 
The current -voltage characteristic of an ideal p-n junction is therefore as shown in Fig. 
2.25: in forward bias the current increases exponentially with voltage, while in reverse bias 
the current saturates at J {) times the junction cross-sectional area. 

2.8.4 Junction geometry and depletion layer capacitance 

The two space charge layers at the junction vary in width and therefore in the amount of 
charge they contain as the bias voltage changes, thereby giving rise to an effective deple- 
tion layer (or junction) capacitance C ] = dQ/dV. This capacitance together with a charge 
storage capacitance (see below) is one of the factors which limits the high frequency oper- 
ation of junction devices. On the other hand, the capacitance can be controlled by the bias 
voltage, and this effect is exploited in voltage-dependent capacitors such as the varaetor diode. 
The charge density within the space charge layers results simply from the charge on the 
ionized impurities. Thus the charge density within the p region is 

? V = - N / (2.52) 

and similarly 

P n = N £ (2.52a) 

Referring to Fig. 2.26, we denote the widths of the p and n space charge regions by .v 
and ,v n , with the origin of .v at the actual junction. Using Poisson's equation we may calcu- 
late the electric field distribution within the depletion layer. For the p side, from eq. (2.52), 
we have 

d'<g eN t 

where e r is the relative permittivity of the semiconductor. Integrating eq. (2.53) with the 
appropriate limits, that is 

rf-„ e /V r() 

e e r J -„ 

gives r <£ () , the maximum value of the junction electric field. We find that 

^ = ~rT~ (2.54) 

fc t r 

Similarly for the n side we have 

«() = (2.54a) 

e n e r 
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FIC. 2.26 Space charge density and variation of electric field / within the deplelion reyt<>n ot a p-n junc- 
tion. Note that as N, t > N, in this example, then x n < x hut the space < har^e -m eat h -.ide nt the ]iinction 
has the same magnitude. 



From eqs (2.54) we have 



Kx p = N d x„ 



(2.55) 



from which we see that the depletion layer extends farthest into the least heavily doped side 
of the junction as we have implied in, for example. Figs 2.20 and 2.2n. This is a result of 
some significance in explaining the operation of many devices. 

We can now relate the electric field to the junction contact potential V'„. .Since in general 



'i(x) = 



dV(x) 
dx 



we may write 

'V, 



-\ "dV(jc) = -V =r %{x)Ax 



The right-hand integral is the area of the %{x) versus x triangle shown in Fig. 2.26 so that 
-V = ' W? (2.56) 

where the width of the depletion layer W is 



W=hV p i + iA-n 



(2.57) 
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Substituting for '£„ from eq. (2.54a) into eq. (2.56) gives 



iEnE.- 



while combining eqs (2.55) and (2.57) gives 
WW,, 



"d + Nu 



(2.58) 



>.59) 



Then substituting ,v n from eq. (2.59) into eq. (2.58) yields 



V„ 



2e () e r 



N, + N. 



W 



If a bias voltage is applied, the above argument still holds but we must replace V„ by 
(V tt - V), where V is positive for forward bias and negative for reverse bias. Therefore, finally, 
we have 



W 



2e„E r / I 1 

e \ N.. N., 



(2.60) 



The capacitance associated with the depletion layer can be obtained as follows. The stored 
charge Q- on either side of the junction is given by 

iQ^AeN.x^AeN^-p {2 ,6l) 

where A is the junction area. Hence using eqs (2.59) and (2.60) we may write 



\Qh 



AeN d N a W 



= A 






.A/, 



(2.62) 



If the bias voltage V changes then Q ] changes so that the junction capacitance C is given 
by differentiating eq. (2.62) with respect to V. Hence 



C, 



dV 



2rc E r 



"*K 



VP- 



(V -V)\N d+ N. di 



(2.63) 



It is left as an exercise for the reader to show that C i = Ae„E r /W farads, indicating that the 
abrupt junction behaves rather like a parallel plate capacitor of plate separation W and area 
A containing a medium of relative permittivity e r . We note, for reasonably large values of 
reverse bias V, that C^ V~ [/2 . In contrast, for the case of a graded junction it is found that 

Co=v-"\ 

In forward bias the junction capacitance C } is swamped by another capacitive effect which 
gives rise to the charge storage or diffusion capacitance C d . In forward bias, many carriers 
are injected across the junction and it is the time taken for this injected carrier density to 
adjust to changes in the forward bias voltage which gives rise to the charge storage capa- 
citance. The principal mechanism whereby the injected carrier density adjusts is recombi- 
nation, and consequently the expression for C d includes the carrier lifetime t, that is 
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C d = elx/kT (see ref. 2.12). 



EXAMPLE 2.7 Junction depletion layer capacitance 



We may calculate the capacitance of a silicon p" -n junction at a reverse bias of 4 V, 
given that A/ d = 4 x 10 21 m'\ V = 0.8V. e r = 11.8 and that the junction area is 



4xl(T 7 m\ 



In a p + -n junction, A/ a >A/ d and hence eq. (2.63) reduces to 



A 
C -2 



lez&Fi 



Vo-V 



1/2 



With reverse bias l/ = -4Vwe find that 
Cj = 33.4pF 



2.8.5 Deviations from simple theory 

Although the theory described above and the diode characteristic shown in Fig. 2.25 are in 
reasonable agreement with what is observed in practice, there are several points of differ- 
ence. For our purposes one of the most important deviations is that, at sufficiently large values 
of reverse bias, breakdown occurs. That is, there is a sudden and rapid increase in reverse 
current at a particular value of reverse bias voltage (e.g. Fig. 2.27b). 

Reverse breakdown occurs by two mechanisms. The first, called the Zctwr effect, is due 
to quantum mechanical tunnelling. This takes place most readily in heavily doped junctions, 
which from eq. (2.60) leads to narrow depletion layers and therefore high junction fields. 



p-type 



n-type 





(a) (b) 

FIC. 2.27 Schematic representation of electron tunnelling in Zcner breakdown <>t a heaulv doped p n 
junction under reverse bias (a) and the resulting effect on the i-V rhara< U'nstii <t>>. 
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In effect, as we can .see from Fig. 2.27(a). the energy bands on the two sides of the junction 
become 'crossed' so that filled states in the valence band of the p side are aligned with empty 
states in the conduction band of the n side. Electrons, therefore, tunnel from the p to the n 
side vastly increasing the reverse current. 

The second mechanism, avalanche breakdown, occurs in lightly doped junctions with wide 
depletion layers. This mechanism involves impact ionization of the host atoms by energetic 
carriers. If earners crossing the depletion layer acquire sufficient energy from the electric- 
field between collisions, they may ionize lattice atoms on colliding with them. The electrons 
and holes so produced may, in turn, cause further ionizing collisions and so on to generate 
an avalanche of carriers. Neither breakdown mechanism is in itself destructive to the junc- 
tion. If. however, the reverse current is allowed to become too large then Joule heating may 
cause damage to the device. 

Other important deviations arise from our having ignored such factors as carrier genera- 
tion and recombination within the depletion layer. Carrier recombination leads to an increase 
in current in the forward direction as the earners which recombine must be replaced from 
the external circuit. In silicon and gallium arsenide diodes, this current mechanism may be 
more important than current flow due to carrier injection, especially at low currents. This 
and other factors lead in some cases to the p-n junction having a current- voltage relationship 
of the formy = 7 [exp(eV/pUT) - 1], where [3 varies between 1 and 2 depending on the semi- 
conductor and the temperature. Because (3 determines the departure from the ideal diode 
characteristic, it is often called the ideality factor. 

Carrier generation in the depletion layer gives rise to a larger value of reverse bias current 
than the simple theory predicts. Optical generation of carriers within the depletion layer may 
give rise to an increase in reverse current or initiate avalanche breakdown if the reverse bias 
is sufficiently great. These phenomena form the bases of the photodiodes discussed in 
Chapter 7. 



2.8.6 Other junctions 

In the discussion of p-n junctions in the preceding sections the basic material has been the 
same on both sides of the junction, that is we have been discussing the behaviour of semi- 
conductor homojunctions. In many devices, however, this is not the case and heterojunc- 
tions are used with significant advantage. Thus, for example, we have semiconductor 
heterojunctions and metal -semiconductor junctions. We shall briefly describe the basic 
theory of both of these types of junction. It is interesting to note that in both cases the pre- 
dictions of the basic theory are not always borne out in practice; this is often caused by the 
behaviour of defects close to the junction (ref 2.13). 



2. 8. 6. 1 Semiconductor he terojunctions 

Semiconductor heterojunctions are widely used in LEDs and particularly in semiconductor 
lasers (see section 5.10.2). Such junctions are formed by growing a particular semiconduc- 
tor crystal onto another different semiconductor crystal. This is relatively easy providing that 
the crystals both have the same crystal structure with comparable lattice parameters. At the 
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FIG. 2.28 A p-N heterojunction: ia) shows the energ\ band? nf the -rn:< ur.dui t"t- -t'p.iMk'K: >b> show's 
the energy bands after junction formation. I For a CaAs/(,a, Al V hct<T.>n>n< tn.n /_ 14>e\. 

0.05 eV. i 



i" = 1.8 eV,AE, =0,32 eV and AE 



boundary between the two materials, however, there is a change in Mich material parame- 
ters as the energy gap £ E , the work function <t>. the electron affinity x and the relative per- 
mittivity e r (and hence also the refractive index n). 

At a heterojunction each of the semiconductors may be doped n-type or p-type so there 
are four possible combinations, namely n-N, p-P (i.wtxpc) and n-P. P-n itinis< >type) junc- 
tions, where the capital letter denotes the wider bandgap material. Several devices use het- 
erojunctions formed between gallium arsenide and gallium aluminium arsenide, Ga, Al,_ v As, 
and we shall base our discussion on these materials, h should be noted that the values or 
the material parameters mentioned above for gallium aluminium arsenide depend on the 
precise composition, that is on the value of .v. 

Let us first consider a p-N heterojunction. Figure 2.2S<a) defines the characteristics of 
the materials separately. Figure 2.28(b) illustrates the electron energy bands of the junction 
in equilibrium, which at first glance may seem to be rather confusing. If it is recalled, 
however, that mobile charges flow across the junction, leaving behind ioni/ed impurities, 
until equilibrium is reached (i.e. the Fermi level is constant across the junction), and that 
the energy gap of a semiconductor remains constant within the semiconductor tor constant 
doping, then the situation becomes clearer. Electrons have flowed from regions ot high to 
low potential so that a depletion layer is formed in the conduction band of the N-type gallium 
aluminium arsenide and the bands bend upwards. Correspondingly an electron accumula- 
tion layer has formed in the p-type gallium arsenide so that the band bends downwards. An 
energy spike of height AE C is formed between these regions; the precise value ot A/\ is deter- 
mined by the values of £ g and x for the two materials on either side of the junction. Similarly 
a discontinuity of height A£ v appears in the valence bands at the junction. We would expect 
such junctions to be rectifying, and indeed they are in practice. Similarly P n junctions, as 
illustrated in Fig. 2.29, are expected to be rectifying, and indeed are found to be so in prac- 
tice. The same is not always the case with n-N and p-P junctions based on these materi- 
als, which frequently exhibit ohmic characteristics despite expectations to the contrary. 
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FIC. 2.29 An n-P heterojunition before !at and after ihl junction formation. 

The potential barriers created at heterojunctions can be .significantly larger than in the case 
of homojunctions; thus, for example, electrons injected into the narrow bandgap material 
(gallium arsenide) tend to accumulate there. Consequently, as we shall see in section 5.10.2, 
if an appropriate double heterojunction structure is formed carriers can be confined in the 
region between the two heterojunctions and thus form the active region of a semiconduc- 
tor laser. A further rather obvious, but useful, factor is that radiation created by band-to-band 
electron-hole recombinations in the wide bandgap material will not be absorbed by the 
narrow bandgap material. This is important in the fabrication of efficient surface- and edge- 
emitting LEDs. Further details of these and related benefits resulting from the use of het- 
erojunctions are described in ref. 2.13. 



2. 8. 6. 2 Metal-semiconductor junctions 

Metal -semiconductor junctions are attractive because of their ease of fabrication, and, as 
we shall see in section 7.3.6.5, they are useful when operation with a fast response is required. 
The behaviour of ideal metal -semiconductor junctions depends largely on the doping type 
of the semiconductor and the relative size of the work functions of the two materials. 

Let us first consider a metal to n-type semiconductor junction in which the work func- 
tion of the metal is greater than that of the semiconductor (i.e. <t> m ><t) s ). The energy bands 
of the separated materials are illustrated in Fig. 2.30(a), while the situation after the junc- 
tion is formed is shown in Fig. 2.30(b). As usual, when contact is established, charge flows 
across the junction until the Fermi levels are aligned in equilibrium. In this case electrons 
flow from the semiconductor into the metal thereby creating a depletion layer in the semi- 
conductor close to the junction. The accumulated negative charge in the metal is balanced 
by the positive charge on the uncompensated donor ions in the semiconductor. The result- 
ing junction electric field, contact potential difference and band bending are similar to the 
effects already discussed for p-n junctions in section 2.8.1 . The depletion layer width, for 
example, may be calculated from eq. (2.60) by treating the junction as though it were a p + -n 
junction. 

The equilibrium potential difference V , which prevents further net electron diffusion across 
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FIC. 2.30 A Schottky barrier formed by contacting a metal to an n-Kpe m-hik < .nduie Un with the metal 
having the larger work function: band diagrams (ai before and ib alter the jum turn i- tormed. 

the junction from the n-type semiconductor to the metal, is given by (©„,- <bj as shown in 
Fig. 2.30(b). The potential barrier height V' B for electron injection from the metal into the 
semiconductor is slightly different, and is given by <t> m - X,- *'here x„ i s ihe electron affinity 
of the semiconductor. The potential barrier V can be decreased or increased by the appli- 
cation of an external forward or reverse bias voltage respectively. In cither case V' B remains 
constant so that the junction behaves as a rectifying one as illustrated in Fig. 2.3 1. The junc- 
tion between a metal and p-type semiconductor, where <t>,„<6,. is also rectifying: both of 
these rectifying junctions are referred to as Schottky barrier diodes. 

In both cases the forward current is due to the injection at majority carriers from the semi- 
conductor into the metal; this is in contrast to the forward current How in p-n junctions. 
The absence of minority carrier injection and the associated charge storage delay time (which 
is determined by the carrier recombination time) results in a very much reduced capacitance, 
which is important, for example, in the behaviour of fast Schottky barrier photodiodes with 
short response times. 
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FIC. 2.31 The metal to n-type semiconductor junction shown in Ftg. 2. (0 under i.ti tomaid bias and lb) 
reverse bias. 
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In contrast let us consider the behaviour of metal to n-type semiconductor junctions in 
which <J> m < <J> S . The Fermi levels now become aligned as a consequence of" electron flow from 
the metal into the semiconductor, thereby raising the semiconductor electron energies rel- 
ative to those in the metal in equilibrium. In this case there is no depletion layer in the semi- 
conductor, and there is only a small potential barrier opposing further electron flow from 
the metal to the semiconductor, which is easily overcome by the application of a small exter- 
nal bias. Thus in effect we have an ohmic junction in which the current How is essentially 
independent of the direction of the bias voltage. The same is the case for junctions involv- 
ing p-type semiconductors in which <j> m < <j\. 

While it is imperative that we should be able to form good ohmic contacts between semi- 
conductor devices and external circuits, the ohmic contacts described above do not always 
behave as expected, and often display rectifying properties. This is largely because of the 
presence of surface states at the junction interface. Unlike semiconductor p-n junctions, 
which are formed within a single crystal, metal -semiconductor junctions include a termi- 
nation of the semiconductor crystal. The surface of the semiconductor invariably includes 
surface states which can trap positive or negative charge at the surface, depending on the 
energies of the states. This trapped charge can cause band bending of the semiconductor 
energy bands resulting in the formation of a potential barrier (even in the absence of the metal) 
and in rectifying behaviour (ref. 2.14). 

One way of ensuring good ohmic contacts is to create a very heavily doped p + orn + layer 
very close to the semiconductor surface. If a potential barrier exists at the surface the deple- 
tion layer then will be thin enough so that earners can easily tunnel through it. Such heavy 
surface doping layers can be obtained by using appropriate metals which act as donors or 
acceptors as the contact material, for example aluminium on p-type silicon. Alternatively 
they can be created by including an additional diffusion or ion implementation stage in the 
fabrication process. 



( 2 - 9 ) The quantum well 

There has been considerable interest recently in devices based on semiconductor structures 
in the form of a very thin slab, in which the thickness is less than about 10 nm, and very 
much less than the length and breadth of the slab. Quantum well structures may. for example, 
be formed by sandwiching a region of gallium arsenide between two gallium aluminium 
arsenide regions; the gallium arsenide is then bounded by two heterojunctions. If electrons 
are confined within such a structure then the energy levels they occupy can be obtained from 
eq. (2.13), we suppose that the slab thickness is L., and that the other dimensions are L v and 
L r , so that 

Ir 

E(n l ,n 2 ,>h) = —-^[~ + --^ + -~\ (2.64) 




where /!,,/;■>, /i 3 are positive integers greater than zero. There is of course a corresponding 
expression for holes in the valence band. 
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FIG. 2.32 Energy levels within a quantum well structured! C.i.-V. -..tndwn h.'d between tun UiMA> 
regions: (a) shows the discrete electron and hole energy levels torn - 1 J . < uhile h -lu>u- the sub- 
bands corresponding to these values of n, (they are separated tor < l.intv . but m reahh thev are 
superimposed on one another). 



Since L. is so small compared with L x and Z,,. the energy Ie\ eh will form groups, or sub- 
bands, of closely spaced levels determined by the various integral values of ", and n 2 . for 
each value of « 3 as shown in Fig. 2.32. The density of states distribution in this case is no 
longer the smooth parabolic curve depicted in Fig. 2.14(a). hut rather it has the -staircase' 
structure shown in Fig. 2.33. There are a number of consequences of this situation. Firstly 
we see that the lowest energy states, corresponding to », = 1 , are situated at an energy approx- 
imately h 2 /Sm*L 2 above the bottom of the conduction band, and correspondingly the upper- 
most states in the valence band are about h 2 /8m*L: below the top of the band. Thus the 
effective energy gap between the bands has increased by an amount 

h 2 I 1 1 \ 



A£ =■ 
8 SLl 



, m„ 



m. 



If we take L. = 10 nm, then as Example 2.8 shows, A£ , is of the order of < >. 1 cV in da As. 



EXAMPLE 2.8 Energy gap in a GaAs quantum welt 

We may calculate the effective increase in the width of the energy gap in a quantum 
well structure in GaAs, with a thickness L. of 10 nm. 
For GaAs 
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< = 0.068/;; in* = 0.56;;; 



Hence 

A£„ 



(6.626 x 10" 



e 8x(10x It) -9 )- x (9.1 x 1()-- ,I )\U.()68 0.56 



1 



9.93 x 10 ■' J or 0.062 eV 



Secondly we see that there are more states towards the bottom of the conduction band 
than for the normal case (i.e. when L r L v , L, are all comparable). 

Because of the relative magnitudes of L K , L v , L. the lowest energy states will all have;;,= 1, 
and will effectively constitute a 'two-dimensional' group of states with varying n l and a-, 
values. Now while the density of states function for a normal three-dimensional solid is given 
by eq. (2.29) from which we see that Z(E)°cE i/2 , the corresponding function for the two- 
dimensional ease is given by Z(E) = 4izm c /lr. which is independent of energy (see Problem 
2.13). The distribution of electrons over these states at temperature T is given by 



/;(£) =Z(E)F(E), which, assuming that E-E F P kT, becomes 



n(E) « — — exp 
/r 



E-E F 
kT 



(2.65) 
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FIG. 2.33 The 'stairwell' density of states for the quantum well structure (solid lines) compared with the 
density of states for the 'normal' solid (dashed lines). 



86 ELEMENTS OF SOLID STATE PHYSICS 

The halfwidth of this distribution is given by £7" log.. 2 orU.MLT. which is appreciably smaller 
than the halfwidth of \MT for the three-dimensional case. 

A third notable difference between the properties of the bulk semiconductor and the two- 
dimensional quantum well structure involves the properties of exciton-* (section 2.4.3). In 
theory, exciton spectral absorption lines for bulk semiconductors should be visible at an 
energy just less than that of the bandgap wavelength /...< = Ejln ). In practice such absorp- 
tion lines are only seen in very pure materials, and then usually only at low temperatures. 
Any impurities present in the semiconductor have the effect of providing screening of the 
Coulombic interaction between the electrons and holes. In quantum well structures the elec- 
trons and holes are much more closely confined so that their wavefunctions overlap more 
strongly and transition probabilities are higher. In addition, the -compression" in one dimen- 
sion has the effect of increasing the exciton binding energy to two or three times that given 
by eq. (2.28a), Exciton transitions thus become much more pronounced anil absorption lines 
are readily observed even at room temperatures. 

If an electric field is now applied along the : direction, that is perpendicular to the plane 
of the slab, the exciton electron and hole wavefunctions are considerably modified and the 
exciton resonances both broaden and move to lower energies (ref. 2.1?!. This is called the 
quantum confined Stark effect, and is the basis of a type of modulator to be discussed in the 
next chapter (section 3.9). It should be mentioned that bulk semiconductor materials show 
a change in absorption at wavelengths close to the bandgap wavelength when an electric field 
is applied, but the effect is very much smaller than in the quantum well structure. It is also 
noteworthy that, as materials properties and fabrication techniques improve, quantum wells 
are increasingly being used in active and passive optical and optoelectronic devices. These 
include LEDs and lasers (section 5.10.2), photodiodes and waveguides as well as optical 
modulators, (ref. 2.16) 



PROBLEMS 



2.1 Explain the significance of each of the four quantum numbers used to describe atomic 
energy states. Show how constraints on the values of these numbers lead to a system 
of 'shells' and 'subshells' of energy states. Draw up a table of all the possible states 
for h = 4. 

2.2 Explain what is meant by the Pauli exclusion principle and show, on the basis of your 
answers to Problem 2.1, what is meant by the electron configurations of the elements. 
What are the expected electron configurations of diamond (X = 6), Si (/= 14). Ge 
(Z = 32), Sn (Z = 50) and Nd (Z = 60)? Why might the actual configurations be differ- 
ent from those that you expect? 

2.3 An electron in an electron microscope is accelerated by a voltage of 25 kV; what is its 
de Broglie wavelength? 

2.4 Show that the uncertainty principle can be expressed as AEAi > h/2n. where A/-.' and Ar 
are the uncertainties in energy and time respectively. What is the uncertainty in the 
velocity of an electron confined in a cube of volume of 10 "' m ! ? 



PROBLEMS 87 

2.5 Calculate the energies of the first three levels for a cubic potential well of side length 
10"'° m. How much energy is emitted if an electron falls from the third to the first 
level? If the emitted energy is in the form of a photon, what is the frequency of the 
associated wave? 

2.6 According to the de Broglie relation we may regard a particle as having a wavelength 
X. Show that when an electron is moving along a one-dimensional array of ions with 
separation a, then a standing wave will be set up when the electron wavevector (2njX) 
is equal to im/a where n is an integer. 

From the standpoint of quantum theory, these standing waves represent electron 
wavefunctions. In fact two different wavefunctions can be set up corresponding to each 
n value since each ion may be either a node or an antinode for the standing wave. Show 
that these two will have different energies. (Hint: consider the Coulombic energy result- 
ing from the interaction of the electronic charge distributions with the ions.) How does 
this energy difference manifest itself? 

2.7 At 300 K the conductivity of intrinsic silicon is 5 x 10" Q. ."' m '. If the electron and 
hole mobilities are 0.14 and 0.05 m 2 V" 1 s~' respectively what is the density of 
electron-hole pairs? 

If the crystal is doped with 10 22 m" 3 phosphorus atoms, calculate the new conduc- 
tivity; repeat for the case of boron doping at the same impurity level. Assume all the 
impurities are ionized in both cases. 

2.8 Estimate the ionization energy of donors in GaAs given that /»* = 0.07m and 
e r = 10.9. 

2.9 Calculate the probabilities of finding electrons at energy levels of E ¥ + 0.05 cV and 
E F - 0.05 eV at T = K and 300 K. 

2.10 The atomic mass number of copper is 63.54 and the density of copper is 
8.9 x 10' kg m" 3 ; confirm that its Fermi level is about 7.0 eV. 

2.1 1 Calculate the intrinsic carrier concentration in GaAs at 290 K given that the electron 
effective mass is 0.07m, the hole effective mass is 0.56m and that its energy gap is 
1.43 eV. 

2.1 2 Calculate the diffusion coefficients of electrons and holes in silicon at 290 K from the 
data given in Problem 2.7. If it is assumed that the electron and hole lifetimes are both 
50 lis, what are their diffusion lengths? 

2.13 Derive eq. (2.29) for the density of states - use eq. (2.13) to set up a three-dimen- 
sional integer space in which each point (i.e. each combination of n , , n 2 , n } ) represents 
an energy level. Hence show, using eq. (2. 1 1 ), that the number of energy states with 
energy less than a reference energy £ R (which equals the appropriate volume of integer 
space) is (4n/3)(%E R rnL 2 /h 2 ) Vl . (Remembering that only positive integers n v n 2 ,n y 
have any physical meaning and that there are two spin states per energy level, we arrive 
at eq. (2.29).) Show also that for two-dimensional integer space, the density of states 
is independent of the energy. 
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2.14 Using eqs (2.29) and (2.30). derive eq. (2.33) tor the electron densitj in the conduc- 
tion band of a semiconductor. Make, and it" possible juMih . the following assump- 
tions: (1) that the energy range o\~ the conduction band can be taken as 
E C <EK°°; and (2) that we can ignore the term - I in the denominator of eq. (2.30) as E 
-E ¥ >kT. Furthermore, make the substitution v = iF - /.' > kT and note that 

f .v iy2 exp(-.v)dv = VV2 

2.15 An electron current of 10 mA is injected into a p-type silicon rod o\ 1 ninr cross-sec- 
tional area. Assuming that the excess concentration decrease- exponentially and that 
at 5 mm from the contact the excess concentration has fallen to 40 < " t of its value at 
the contact, calculate the value at the contact. You may assume that 
£> e = 3.4xlO- 3 mV ! . 

2.16 Calculate the equilibrium contact potential for a step junction in silicon if A, = 10" m - 
and/V d = 10 24 m~ 3 ; takee r = 12, n t = 1.5 x ]()''' m ; and T=2WK. Also calculate the 
width of the junction both in equilibrium and with a reverse bias voltage of 50 V. 

2.17 The resistivities of the p and n materials forming a p-n junction are 4.2 x 10" and 
2.08xl0" 2 Q m respectively. If the hole and electron mobilities are 0.15 and 
0.3 m 2 V""" 1 s" 1 , the hole and electron carrier lifetimes are 7? and 150 us respectively 
and the intrinsic carrier concentration is 2.5 x 10 1 " m '. calculate the saturation current 
at 290 K given that the junction area is 10"" m\ What fraction of the current is carried 
by holes? 

2.18 Calculate the maximum wavelengths of light which will give rise to photoeffects in 
intrinsic GaAs (£ g = 1.43 eV), CdS (E e = 2A eV) and InSb f/-.\ = 0.225 e\'). 

2.19 Show that the average energy of the free electrons (which is equal to the total energy 
divided by the electron density) in a solid at K is given by (/•. ) = ^ /•. ', . (Hint: the total 
energy of the free electrons can be calculated by multiplying the population of each 
energy state by its energy and integrating over all states.) 

2.20 Show that the most probable electron energy, and therefore the maximum electron 
density, in the conduction band of a semiconductor is I kT above the bottom of the 
band. Also show that the average electron energy in the conduction band is ! kT. while 
the halfwidth of the distribution is approximately IMT. 

2.21 Show that an electron of energy E approaching a narrow potential barrier of height 
V (V >E) may penetrate the barrier (i.e. tunnel through it) providing the barrier is 
not infinitely high. (Hint: if V is not infinite then y does not become zero at the barrier; 
we may use the conditions that y and dy/dvare continuous at each side of the barrier. 
Thus V|l must have a value on the far side of the barrier, so that y " \\i (or j \\i(\ ) I' ) > s Iin > te - 
implying that there is some probability of finding the electron beyond the barrier - 
set up \\i for each of the three regions: before the barrier, within the barrier and beyond 
the barrier.) 
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Modulation of light 



The advent of the laser (see Chapter 5) and the increasing use oflasers in a wide variety of 
applications have led to a demand for devices which can modulate a beam of light. 
Applications of light modulators include wideband analog optical communication systems, 
switching for digital information recording, information storage and processing, pulse 
shaping, beam deflection and scanning, and frequency stabilization and (^-switching of lasers. 
Some of these applications are discussed in Chapter 6. In this chapter, we have interpreted 
the term modulation rather broadly so that we also include sections on scanning and some 
aspects of laser wavelength tuning. Several of the materials, for example KDP. which are 
useful in conventional modulators exhibit non-linear effects and consequently may be used 
for harmonic generation and parametric oscillation. These techniques, together with those 
described in section 6.5.1.6, enhance the available range of laser wavelengths. 

A modulator is a device which changes the irradiance (or direction) of the light passing 
through it. There are several general types of modulator: namely, mechanical choppers and 
shutters, passive (or dye) modulators, electro-optic, magneto-optic and elasto-optic (acousto- 
optic) modulators. The first two types will be covered briefly in section 6.4 in which laser 
(^-switching is discussed. In the remaining types the refractive index and other optical char- 
acteristics of a medium are changed by the application of a force field, that is electrical, mag- 
netic or mechanical (acoustical). In these cases, apart from the acousto-optical effect where 
the variation of refractive index creates a 'diffraction grating', the applied force field mod- 
ifies the polarizing properties of the medium. This in turn may be used to modify the phase 
or irradiance of abeam of light propagating through the medium. Accordingly this chapter 
begins with a brief review of optical polarization, birefringence and optical activity in nat- 
urally occurring crystals. 



(ilLJ Elliptical polarization 

We have already described (section 1.2.1) plane polarized light in which all of the wave trains 
comprising a beam of light have their electric vectors lying in the same plane In many cases 
of interest a beam of light may consist of two plane polarized wave trams with their planes 
of polarization at right angles to each other, and which may also be out of phase. 

Let us consider initially the special case where the amplitudes of the two wave trains are 
equal and the phase difference is tt/2. In this case, if the wave trains arc propagating in the 

90 
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z direction, we may write the component electric fields as 

"8, = /"£„ cos(A-z - to/) 
and (3.1) 

«,. =/£„ sin(^ - tat) 

where to is the angular frequency, A- (= 2n/X,) is the wavenumber or propagation constant and 
/ and j are unit vectors in the x and v directions respectively. The total electric field is the 
vector sum of the two components, that is 

% = % f + % r 

or 

* = '"£,)[/ cos(A'z - (Mt) +j s'm(kz - to/)] (3.2) 

The resultant expressed by eq. (3.2) can be interpreted as a single wave in which the elec- 
tric vector at a given point in space is constant in amplitude but rotates with angular fre- 
quency to. Waves such as this are said to be circularly polarized. Figure 3.1 shows the electric- 
field vector (a) at a given instant of time and (b) at a given point in space. 

The signs of the terms in eq. (3.2) are such that the electric vector at a given point in space 
has a clockwise rotation when viewed against the direction of propagation. A wave with such 
an electric vector is said to be right circularly polarized. 

If the sign of the second term is changed (this is equivalent to a change of n in the phase 
of '£,.) then the sense of rotation is counterclockwise and the wave is said to be left circu- 
larly polarized. 

When the amplitudes of the electric vectors of the two waves are not the same but the 
phase difference remains at n/2 then the resultant electric vector at any point in space rotates 
at frequency to but changes in magnitude. The end of the electric vector describes an ellipse 
as illustrated in Fig. 3.2. 

If the component waves can be represented by % x = i% a cos(kz -mt) and 





Direction of propagation 

(") <b) 

FIG. 3.1 Right circularly polarized light: (a) electric vectors at a given instant in time and (bi rotation of the 
vector at a given position in space. (Note that in this, and the following diagrams, for the sake of clarity only 
the electric vectors are shown.) 
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Direction of propagation 

(a) fbi 

FIG. 3.2 Right elliptically polarized light; electric vectors at <a <i ui\cn m-tant m time and lv a given 
position in space. 

« v =j'&' Q sm(kz - to/) with <£ * 1.;,, then the major and minor axes of the ellipse are parallel 
to the x and y axes. In general the electric vector amplitudes are not equal and also there is 
an arbitrary phase difference cj> between the component waves. In this case the end of the 
resultant electric vector again describes an ellipse but with the major and minor axes inclined 
at an angle of \^x\- ] [{2%^' {) cos <t>)/(£;, 2 - <f r,)] to the ,v and y axes (see Problem 3.1). In all 
of these instances, the resultant wave is said to be elliptically polarized and in fact plane and 
circular polarization are special cases of elliptical polarization. For the purposes of this book 
it is not instructive to dwell on a full discussion of elliptically polarized light {ref. 3.1) but 
rather to describe it simply in terms of its components parallel to and perpendicular to a con- 
venient axis or plane. 



C 3 - 2 ) Birefringence 

Ordinary glass is isotropic in its properties, but many important crystalline optical materi- 
als such as calcite (CaC0 3 ), quartz (Si0 2 ) and KDP (potassium dihydrogen phosphate, 
KH 2 P0 4 ) are anisotropic. This anisotropy is due to the arrangement of the atoms being dif- 
ferent in different directions through the crystal. Thus, for example, the electric polariza- 
tion P produced in the crystal by a given electric field % is not a simple scalar multiple of 
the field but in fact varies in a manner that depends on the direction of the applied field in 
relation to the crystal lattice. One of the consequences of this is that the refractive index, 
experienced by electromagnetic waves travelling through the crystal, depends on not only 
the direction of propagation of the waves, but also the direction of the polarization of the 
waves. Consequently the phase velocity of the waves also depends on these directions. Such 
crystals are said to be birefringent or doubly refracting; the names refer to the fact that in 
general there are two different directions of propagation through the crystal that a given inci- 
dent ray may take depending on the direction of its polarization. The rays corresponding to 
these directions travel with different velocities and have mutually orthogonal planes ol polar- 
ization. Consequently, when unpolarized light or light of arbitrary polarization relative to 
the crystal structure is incident on a doubly refracting crystal, the light propagating through 
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FIG. 3.3 Double refraction by a birefringent crystal. 



the crystal may be considered to comprise two independent waves which travel with dif- 
ferent velocities. Similarly if light propagates in a given direction through the crystal there 
are only two possible values of phase velocity, which correspond to two orthogonal states 
of polarization. 

The theory, which is not very straightforward, shows that in general crystals exhibit three 
different principal refractive indices and two optic axes (see Appendix 2 and ref. 3.2 for 
further discussion). The optic axes are directions in the crystal along which the velocities 
of the two orthogonally polarized waves are the same. In many important crystals, for 
example calcite, two of the principal indices are the same and there is only one optic axis. 
Such crystals are called uniaxial whereas other doubly refracting crystals, for example mica, 
are biaxial. In cubic crystals which are isotropic the principal indices are all the same. 

A simple way of observing birefringence is to allow a narrow beam of unpolarized light 
to fall normally onto a parallel-sided calcite plate as shown in Fig. 3.3. The beam is found 
to divide into two parts. One, the so-called ordinary or O-ray (represented in diagrams by 
••••*), passes straight through the crystal, as might be expected, while the other, the so-called 
extraordinary or E-ray (represented in diagrams -Hfh). diverges from the O-ray as it passes 
through the crystal and then emerges parallel to it. This is found to be the case unless the 
direction of incidence of the original beam is parallel or perpendicular to the optic axis. The 
ordinary and extraordinary rays are found to have orthogonal directions of polarization, which 
are normal to and parallel to the principal section of the crystal (shown in Fig. 3.3), which 
is a plane containing the optic axis and which is normal to a pair of opposite parallel sur- 
faces of the crystal. 

The relationship between the different values of refractive index and crystal structure 
can be shown on the refractive index ellipsoid or optica! indicatrix, which is illustrated 
in Fig. 3.4 for uniaxial crystals, which displays two refractive indices. As implied above, 
these are the ordinary refractive index /? , which does not vary with direction, and the extra- 
ordinary refractive index n e (or more precisely n e (Q)) whose value depends on the direc- 
tion of propagation 6 relative to the optic axis. Although the index ellipsoid and its 
applications are described in more detail in Appendix 2, we can illustrate its use by con- 
sidering light propagating along the direction r shown in Fig. 3.4, where for convenience 
(but without loss of generality) the projection of r on the at plane is along the v axis. The 
plane normal to r which passes through the origin intersects the index ellipsoid along the 
perimeter of the shaded ellipse shown. The two directions of polarization mentioned above 
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FIG. 3.4 The refractive index ellipsoid for a uniaxial crystal. Waves with polarization- parallel to the x and 
y axes experience a refractive index n„, while those polarized parallel to the/ av- experience.! retractive 
index n,.. For wave propagation in a general direction r there are two allowed dtret lion- o! polarization, i.e. 
parallel to OP with index n,„ and parallel to OQ with index n, !0i. 



then lie along the principal axes (OP and OQ) of this ellipse. In addition the correspond- 
ing refractive indices are given by the magnitudes of OP and OQ. One of these. OP, is 
independent of 9 and represents the refractive index » () corresponding to the ordinary ray, 
while the other, OQ, depends on the angle 9 and gives the refractive index /i c (9) corre- 
sponding to the extraordinary ray. 

It may help us to visualize the behaviour of doubly refracting materials if we were to draw 
surfaces in which the distance of each point from the origin corresponds to the values ot n c {8) 
and n , for wave propagation in the direction 9. The resulting surfaces would be a sphere 
for the ordinary rays and an ellipsoid for the extraordinary ones. Such normal (index) sur- 
faces for a positive crystal are shown in Fig. 3.5. We can now envisage light spreading out 
from a point source located at the origin using Huygens' construction. The ordinary rays 
will form a spherical surface, while the extraordinary ones form an ellipsoidal one. which 
in the case of positive uniaxial crystals will lie inside the sphere as velocity is inversely pro- 
portional to refractive index. The sphere and ellipse will of course coincide as they cross 
the optic axis, and be separated by the greatest distance, and therefore phase difference, in 
the direction perpendicular to the optic axis. We may use this construction to help us to envis- 
age the behaviour of phase plates described in the next section. 
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FIG. 3.5 Normal (index) surfaces for a positive uniaxial crystal projected on the yz plane. 



3.2.1 Phase plates 

Plates of crystals can be cut with particular orientations to the optic axis to produce a desired 
effect in an optical beam propagating through the plate. For example, plates cut with their 
large area surfaces parallel to the optic axis can be fabricated to introduce a given phase 
change between the O- and E- rays as shown in Fig. 3.6. In particular a plate of thickness 
d such that the optical path difference \n d-n e d\ = Xj4 which of course is equivalent to a 
phase change of rc/2, is called a quarter-wave plate. For quartz, for example, d should be 
equal to 0.0164 mm for sodium light (Problem 3.2). When plane polarized light is incident 



Optic axis 



X -- 
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FIG. 3.6 Double refraction by a negative crystal which is cut such that the optic axis is parallel to the 
crystal surface but normal to the plane of incidence. In this case there is no divergence of the E- and 
O-rays, but they become increasingly out of phase as they propagate through the crystal. 
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on a quarter-wave plate, the emergent light in in general ellipticalU polarized. It the plane 
of polarization of a plane polarized incident beam. howc\er. in inclined at 4v to the optic 
axis then the amplitudes of the E and O componentN \\ ill he the Name, and the emergent light 
is circularly polarized as explained in section 3. 1 . In a Nimilar v>a> . half-wave and whole- 
wave planes can be fabricated. Such plates are often iiNed in light modulation systems. 



C 3,3 ) Optical activity 

Certain crystals (and liquids) have the ability to rotate the plane o\ polarization ot light passing 
through them; that is, they are optically uaivc. Thus, for example, when a beam of plane 
polarized light is incident normally on a crystal plate of quartz cut perpendicular to the optic 
axis, it is found that the emergent beam is also plane polarized hut that its electric vector 
vibrates in a different plane from that of the incident light. The plane of vibration may be 
rotated in a clockwise sense looking against the oncoming light h> riaht-hunJcJ or dextro- 
rotatory crystals, or in a counterclockwise sense b\ Icfi-lianJvJ or lacvorounory crystals. 
Quartz exists in both forms. It is found that the rotation depends on the thickness ot the crystal 
plate and the wavelength. The rotation produced by a quartz plate 1 mm thick tor sodium 
light is 21.7° while it is 3.67° for 1 mm of sodium chlorate. 

Optical activity can be explained by assuming that in optical!) actne cr>stals the veloc- 
ity of propagation of circularly polarized light is different for different directions of rota- 
tion, that is the crystal has refractive indices /; r and //, for right anil left circularly polarized 
light. It is easy to show (see Problem 3.3) that a plane polarized wave can he resolved into 
two circularly polarized waves with opposite directions of rotation. If these travel through 
the crystal at different speeds, a phase difference will be introduced between them at dif- 
ferent distances through the crystal. This corresponds to a rotation ot the plane o( the plane 
polarized wave which results from the recombination of the two circularly polarized waves. 



C 3 - 4 ) Electro-optic effect 

When an electric field is applied across an optical medium the distribution of electrons within 
it is distorted so that the polarizability and hence the refractive index ot the medium changes 
anisotropically. The result of this electro-optic effect may be to introduce now optic axes 
into naturally doubly refracting crystals, for example KDP, or to make naturally isotropic 
crystals, for example gallium arsenide, doubly refracting. 

The change in refractive index as a function of the applied field can be obtained trom an 
equation of the form (see ref. 3.3 and Appendix 2) 

A(l/n 2 ) = r r & + P% 2 (3-3) 

where r is the linear electro-optic coefficient and P is the qiuuiraiii electro-optic coefficient. 
In solids, the linear variation in the refractive index associated with rf is known as the Pockets 
effect while the variation arising from the quadratic term is called the Kerr effect (not to be 
confused with the magneto-optic effect also named after Kerr). 
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In the case of the Pockels effect, the precise effects of the applied electric field depend 
on the crystal structure and symmetry of the material under consideration. With KDP, for 
example, if the electric field is applied along the : direction then the .v and v principal axes 
are rotated through 45° into new principal axes.v' and v' and the refractive indices in these 
new directions become (see Appendix 2) 



(3.4) 



»,, = /?„ 


+ 






"V = »o 


- 






Thus, 








<v) 


= 


2An 


'>,,' 



(3.5) 



which is in agreement with eq. (3.3) assuming that the Kerr constant P is very small, where 
r r , 3 is the appropriate electro-optic coefficient for KDP. In the interests of notational con- 
venience we shall drop the subscripts from r, though it should he remembered that the precise 
coefficient used depends on the crystal symmetry and directions of the field and light wave. 

EXAMPLE 3.1 Change in refractive index due to the Pockels effect __ 



We may calculate the change in refractive index for a 10 mm wide crystal of deuterium- 
substituted KDP (i.e. KD*P) for an applied voltage of 4000 V. 
From eq. (3.5) and using the data in Table 3.1 below we have 

n-n = ± \ x26.4x 10" 12 x 1.51- x 4000/10" 

or 

\n-n J = 1.8 x lO--* 1 



Let us consider a beam of plane polarized light propagating in the z direction through a 
crystal such as KDP with its plane of polarization at 45° to the induced axes x' and v' as shown 
in Fig. 3.7. If the incident beam is given by t =% Q cos(wt-kz). then the components along 
the.v' and v' directions will be 



= cos(cor - kz) 

(3.6) 



r £ v - = ~r cos(cor - kz) 

V2 



% 

%y = —r= COS(C0f - kz) 

In view of the fact that these components experience the refractive indices given by eq. (3.4), 
they will become increasingly out of phase as they propagate through the crystal. Thus if 
the crystal is of thickness L the phase changes of the two components will be 
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FIG. 3.7 A beam of plane polarized light incident on an elei tro-<>ph< < rwt.il plate -ubtect to a voltage V 
will be resolved into components with polarizations d lnnj> x and \ . the indu. ed prim ipal directions. The 
induced birefringence occurs in a plane normal to the applied eicc tn< field 
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Using eqs (3.4) which relate the refractive index to the electric field, we see that 
({.,, = — Ln Q {\ + \rnl%.) 

which may be written 

(}> v . = <t> () + A* 

4>v' = <t>o ~ A<t> 
where 



(3.7) 



<3.7a) 



In eq. (3.8) we have taken %. to equal V/L, where V is the applied voltage. 
The net phase shift, or total retardation, between the two waves resulting 



(3.8) 



from the 



L 
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application of the voltage V is seen to be 



Jin 



<]> = ()),. -<|v = 2A(J» = — rn-y 



x„ 



(3.9) 



and the emergent light will in general be elliptieally polarized. 

From eqs (3.6) and (3.7), the components of the wave emerging from the electro-optic 
crystal can now be written as (omitting common phase factors) 



(3.10) 
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= —f= cos(io/ + A§) 


and 
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= ~r- cos((o/ - A<(>) 



(3.10a) 



The phase shift At)) for each component depends directly on the applied voltage V (eq. 3.S) 
so that we can vary the phase shift by varying the voltage applied to a given crystal. 
Suppose that we now insert a plane polarizing element orientated at right angles to the 
polarizing element producing the original plane polarized beam after the electro-optic 
crystal, as shown in Fig. 3.8. Then, as we can see from Fig. 3.8, the transmitted electric 
field components will be -'£,/V2 and X^/\2. That is, using eqs (3.10) we can write the 
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FIG. 3.8 Arrangement of the components of a Pockels electro-optic modulator in which an electro-optic 
crystal is placed between crossed polarizers. The state of polarization at various positions within the device 
is also indicated; the components transmitted by the horizontal polarizer are -t-J^2 and 'ty/^2. 
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transmitted electric field as 

%= [cos(co/ + A<j>) - cos (to/ - A<t>)] 

or 

% = -% Q sin A(J> sin cor 

Thus the irradiance of the transmitted beam, which is given by a\ eraging f ' over a com- 
plete period T= 27t/to, can be written as 

to r 27l/,u i 
/ = — %'dt 



or 



/ = / sin 2 A<t> = / n sin 2 (<P/2) 



(3.11) 



where / is the irradiance of the light incident on the electro-optic crystal. As the phase retar- 
dation in the Pockels effect is proportional to the voltage, we can see from eqs (3.11) and 
(3.9) that the transmittance as a function of applied voltage is given by 

- = sin 2 [ — rnlV) 

which we may write as 



~ = sin 



,2 V. 



(3.12) 



where V n (=\J{2rnl)) [S trie voltage required for maximum transmission, that is 1-1q- *« 
is often called the half-wave voltage since it causes the two waves polarized parallel to trie 
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TABLE 3.1 Characteristics of some electro-optic materials used in Pockels tells 

Material Linear electro-optic n.,t />. t 

coefficient, r (pm V" 1 ) 

KH 2 P0 4 (KDP) 10.6 

KD 2 P0 4 (KD"P) 26.4 

AH 2 PO,(ADP) 8.5 

Cadmium teliuride (CdTe) 6.8 

Lithium tantalate (LiTaOj) 30.3 

Lithium niobate (LiNbOj) 30.8 

Gallium arsenide (CaAs) 1.6 

Zinc sulfide (ZnS) 2.1 
Barium borate (BaB 2 4 ) 

t Values near a wavelength of 550 nm. 

t Low frequency values. 

Note: Several of the materials listed have more than one linear elect ro-opti< toctttc ictil: \u> h.i\c quoted trie 

one which is relevant for use in Pockels eel! modulators. (See ref. 5.5b, p. 1 1-'. tor tuither dct.uK 1 
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principal axes to acquire a relative spatial displacement of \J2, which is equivalent to a phase 
difference of k. Thus a beam of plane polarized light incident on the modulator would have 
its plane of polarization rotated by 90° when a voltage V K is applied to the modulator (see 
Example 3.2). The value of V^ depends on the electro-optic material and the wavelength (see 
Problem 3.4 and Table 3.1). 

Thus we see that the tran.smittance of the system shown in Fig. 3.8 can be altered by the 
application of a voltage along the direction of propagation as illustrated in Fig. 3.9. Such 
systems are referred to as Pockels electro-optic modulators. 

It is obvious that the modulation is not linear; indeed from eq. (3. 12) for small voltages 
V, the transmitted irradiance is proportional to V 2 . The effectiveness and ease of operation 
of a Pockels modulator can be enhanced by including a quarter-wave plate in the beam 
between the initial polarizer and the modulator as shown in Fig. 3.10(a). This introduces a 
phase difference of tt/2 between the two polarized components before they enter the voltage- 
sensitive modulator. A bias is therefore introduced into the transmission curve so that the 
transmission is varied about the point Q, as illustrated in Fig. 3. 10(b), rather than about zero. 
The change in transmission in the vicinity of Q is more nearly linear with voltage than at 
the origin. 



EXAMPLE 3.2 Half-wave voltage 



We may calculate the half-wave voltage for KDP, for example, at a wavelength of 
1.06 urn, using Table 3.1. We have 



V 



A, n 



1.06x10- 



* 2ml 2xl0.6xl0- 12 x(1.51) 3 
V = 14.5kV 
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FIG. 3.9 Transmission curve for the system shown in Fig. 3.8 as a function of the applied voltage. 
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FIG. 3,10 Arrangement of the components of a Pockels electro-optic cell biased with a quarter-wave 
plate (a! and the resulting transmission as a function of applied voltage (b). The bias results in a r )0% 
irradiance transmission; in the vicinity of this point the variation of transmission with applied voltage is 
almost linear. 
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With a quarter-wave plate bias, we see that the phase difference between the components is 
71 n V 

O = - + 2A<b = - + n — 
2 ^ 2 V, 



and therefore from eq. (3.12) 



/ 



■ = sin' 



7t K_V 

4 + 2V 



1 -t-sin — 



For small values of applied voltage V (up to about 5% of V K ), am{nV/V n ) = nV/\'\ and the 
change in irradiance is therefore nearly linear with V. If, therefore, a small sinusoidally 
varying voltage of amplitude V„ and frequency /is applied to the modulator, then the irra- 
diance of the transmitted beam will also vary at frequency/, as illustrated in Fig. 3.10(b). 
That is, we may write 



1 cc * V <» • ft 

_ = 0.5 + sin 2nft 

In 2 



(3.13) 



where V a sin 2nft = V/V n should be very much less than unity otherwise the irradiance vari- 
ation will be distorted and contain an appreciable amount of higher order harmonics. 

The modulator described above is called a longitudinal effect device as the electric field 
is applied in the direction of propagation of the beam. This can be done by using electrodes 
with small apertures in them on either side of the electro-optic crystal (Fig 3. 1 1 ), or by evap- 
orating semitransparent conducting films onto the crystal surfaces. Both of these techniques 
suffer from obvious disadvantages. To avoid these, an electro-optic modulator with a cylin- 
drical crystal and ring electrode geometry has been developed. This device, which is shown 
in Fig. 3.1 1(b), results in very uniform transmission (or polarization) across the effective 
aperture of the device. 

Alternatively, we can use the transverse mode of operation in which the field is applied 
normal to the direction of propagation. In this case the field electrodes do not interfere with 
the beam and the retardation (or phase difference), which is proportional to the electric field 
multiplied by the crystal length, can be increased by the use of longer crystals. (In the longi- 






(a) 



(b) 



FIG 3.1 1 Longitudinal electro-optic cells: (a) shows a cell with end electrodes of relatively small aperture; 
(b) shows the ring electrode geometry with larger aperture. Typical cell dimensions are length 25 mm, 
radius 6 mm and electrode width 8 mm. 
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FIG. 3.12 Transverse electro-optic modulator. The electric field is applied normal to the direction ot 
propagation. 



tudinal effect the retardation is independent of crystal length.) Suppose that as before the applied 
field is in the z direction while the direction of propagation is along the v'-induced principal 
axis, as shown in Fig. 3.12. Then, if the incident light is polarized in the ,v': plane at 45° to the 
x' principal axis, the retardation, using eq. (A2.15) and n. = ;r t . from Appendix 2. is 
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(3.14) 



where L is the length of the crystal, D is the crystal dimension in the direction of the applied 
voltage Vand n , n e are the refractive indices for light polarized parallel to the principal direc- 
tions. The voltage-independent term will bias the irradiance transmission curve. The halt- 
wave voltage may be reduced by having a long, thin cell. Therefore the frequency response 
of transverse cells is better than in longitudinal cells as it is easier to change small voltages. 
However, transverse modulators suffer from having very small apertures. 

In many practical situations the modulation signal is at very high frequencies and may 
occupy a large bandwidth so that the wide frequency spectrum available with laser sources 
may be fully utilized. The capacitance of the modulator and finite transit time of the light 
through it give rise to limitations in the bandwidth and maximum modulation frequency. 
Let C be the capacitance due to the electro-optic crystal and its electrodes and R^ be the inter- 
nal resistance of the modulating source. Then if R s is greater than (2rt/;,C)"'. where/,, is the 
average modulation frequency, most of the modulation potential drop will be across /? s and 
therefore will be wasted as it will not contribute to the electro-optic retardation. This problem 
can be overcome by connecting the crystal in a resonant circuit as shown in Fig. 3. 13. The 
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FlC. 3.13 Electro-optic crystal represented by a parallel plate capacitor (' connected in a resonant tin nil. 

value of the inductance 7' is such that 4jt-/o = \\'SC so that at resonance (/"=/„) the imped- 
ance of the circuit is simply/?,, which is chosen to be greater than ft^and hence most of the 
modulation voltage appears across the crystal. The resonant circuit has a finite bandwidth, 
that is its impedance is high only over the frequency range A/= {2kRjC)' [ (centred on./ ). 
Therefore the maximum modulation bandwidth must be less than A/ for the modulated signal 
to be a faithful representation of the applied modulating voltage. 

In practice the bandwidth A/ is governed by the specific application, though bandwidths 
in the region of 10 8 -10 4 Hz are readily obtained. In addition, if a peak phase difference or 
retardation is required we can evaluate the power we need to apply to the crystal. The peak 
retardation <P m = (2n/X )rnlV m (eq. 3.9) corresponds to a peak modulating voltage 
y = (l_) L. The power P = V\j2R y needed to obtain the peak retardation is therefore related 
to the modulation bandwidth by 



p = 



or 



P = 



4nrn% 



(3.15) 



We have taken the capacitance of the crystal at the modulation frequency /„ to be C = AzfijL, 
where A is the cross-sectional area of the crystal normal to the direction of propagation, in 
which the crystal length is L. 



EXAMPLE 3.3 Power requirement for modulation using a Pockels cell 

We may estimate the power required to give a phase retardation of 7t/30 at a frequency 
bandwidth of 10 4 Hz using a KD*P Pockels cell with a circular aperture of 25 mm 
diameter and 30 mm length at a wavelength of 633 nm. 
From eq. (3.15) and using Table 3.1 we find that 

7t 2 x(633x l(TVx7tx(12.5x lO'V x 50 x 8.85 x I0" !2 x 10 4 



30 2 x 4 x n x (26.4 x 10 
= 306.4 W 



x (1.51)" x 30x10 
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The maximum modulation frequency f m should be such that the electric field applied to 
the crystal does not change substantially in a time equal to the transit time r, of the light 
through the crystal; that is, 

r, = Ln/c « l// m 

Typically L = 1 mm and hence/ m must be substantially less than 2 x 1 0'° Hz in KDP where 
the refractive index n- 1.5. 

To overcome this restriction the modulating signal can be applied transversely in the form 
of a wave travelling along the electrodes with a velocity equal to the phase velocity of the 
optical signal propagating through the modulating crystal. The optical wave then experiences 
a constant refractive index as it passes through the modulator and much higher modulation 
frequencies are possible. Although in principle it is quite easy to arrange synchronization 
of the electrical and optical waves, in practice it is more difficult because of the limitations 
of the optical materials available. Ideally we need a material for which n = Ve r , where e r is 
the relative permittivity (or dielectric constant) of the medium. In most materials this is not 
the case (n < ve r ) and the desired synchronization must be obtained by reducing e r by includ- 
ing air gaps in the electrical waveguide cross-section. Alternatively, a travelling wave mod- 
ulator may be realized by, in effect, slowing down the optical wave. This may be achieved, 
for example, by letting it propagate through the modulator along the zigzag path shown in 
Fig. 3.14. 

3.4.1 Materials 

Any transparent crystal lacking a centre of symmetry exhibits a First-order electro-optic effect. 
To be useful, however, such crystals must have a sizeable electro-optic coefficient r and be 
available in reasonably sized, good quality crystals at modest cost. Some of the properties 
of technologically useful materials are listed in Table 3.1 above. 

Potassium dihydrogen phosphate and ammonium dihydrogen phosphate crystals {KDP 
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FIG. 3.14 Travelling wave modulator. The light beam follows the zigzag path shown so that in ottect it 
propagates along the modulator at the same speed as the modulating signal and thereby experienc es a 
constant modulating field. 
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and ADP) are available in large sizes at relatively low cost but are very hygroscopic and 
fragile. They also have rather large half-wave voltages V n ; however, if deuterium is sub- 
stituted for hydrogen (i.e. KD*P) the electro-optic properties are greatly enhanced. Other 
materials such as lithium tantalate and lithium niobate (LiTaO, and LiNbO,) have much 
smaller half-wave voltages, but reasonably sized crystals are rather expensive. Cadmium tel- 
luride and gallium arsenide are used in the infrared in the spectral ranges 1-28 urn and 
1-14 ujn respectively. 



( 3 -0 Kerr modulators 

Many isotropic media, both solids and liquids, when placed in an electric field behave as 
uniaxial crystals with the optic axis parallel to the electric field. In this optoelectric effect, 
which was discovered in glass by J. Kerr in 1875, the change in refractive index is propor- 
tional to the square of the applied field as we indicated in section 3.4. The difference in refrac- 
tive indices for light polarized parallel to and perpendicular to the induced optic axis is given 
by 

An = n p -n s =KX c i, 2 (3 16 ) 

where K is the Kerr constant (see Table 3.2). 

The electric field induces an electric moment in non-polar molecules and changes the 
moment of polar molecules. There is then a reorientation of the molecules by the field which 
causes the medium as a whole to become anisotropic. This explains the delay that occurs 
between the application of the field and the appearance of the maximum effect. The delay 
can be several seconds, but for non-polar liquids the delay is very small and probably less 
than 10 " s. A Kerr cell filled with one of these liquids and placed between crossed polaroids 
can be used as an optical switch or modulator instead of a Pockels cell. Modulation at fre- 
quencies up to 10 10 Hz has been obtained. 

Although liquid Kerr cells containing nitrobenzene have been used extensively for many 
years, for example in the accurate measurement of the velocity of light, they suffer from the 
disadvantage of requiring a large power to operate them. A more promising approach is to 
use mixed ferroelectric crystals operating at a temperature near to the Curie point where a 
greatly enhanced optoelectric effect is observed. (Ferroelectric materials exhibit a sponta- 
neous electric polarization, similar to the spontaneous magnetization of ferromagnetic mate- 
rials, below a certain temperature. This is the Curie point, above which the crystal structure 



TABLE 3.2 Typical values of the Kerr constant 
K for A = 589.3 nm at about 20°C 



Material 


K(x 10"'" m V"-') 


Water 


5.2 


Nitrobenzene 


244 


Nitrotoluene 


137 


Classes - various 


0.03-0.17 
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changes and the ferroelectricity disappears.) Potassium tantalate niobate (KTN). which is 
used in Kerr effect devices, is a mixture of two crystals with high and low Curie points giving 
a Curie point for the mixture near to room temperature. The crystal has to be 'poled' by the 
application of a large bias voltage. This has the effect of causing the ferroelectric domains 
with electric polarization in the direction of the applied field to grow at the expense of the 
other domains until the whole crystal is polarized in one direction. This then reduces the a.c. 
voltage required for 100% modulation to about 50 V peak and the half-wave voltage is very 
much less than for other materials, being about 250 V in KTN and barium titanate (BaTi0 3 ). 
Nevertheless, most practical electro-optic modulators make use of the Pockels effect. 

3.5.1 Optical frequency Kerr effect 

In the optical frequency Kerr effect, as the name implies, the refractive index change is 
brought about by an applied optical frequency field. This effect offers the interesting pos- 
sibility of one beam of light being used to switch another if the refractive index change can 
be used to eliminate the second beam. Such optical beam switching can be achieved using 
a Fabry-Perot interferometer with an electro-optic material as the spacer between the reflect- 
ing plates. Intense maxima in the interference pattern formed by light passing through the 
interferometer occur when pK = (2n/X Q )nd cos 6, where p is the order of interference, d is 
the plate separation and 8 the angle between an internally reflected ray and the surface normal. 
A small change in the refractive index n of the spacer material induced by an optical beam 
would detune the interferometer and switch off the transmitted beam. 



( 3 Q Scanning and switching 

We saw in section 3.4 that the application of a voltage V^ to a Pockels cell will in effect rotate 
the plane of polarization of the transmitted optical beam through 90°. Thus if a block of bire- 
fringent material is placed after the cell the beam can be switched from one position to another 
as shown in Fig. 3.15. An array of m such combinations in sequence can obviously by used 
to address 2'" different locations. Such a system may be used, for example, in bit-oriented 
optical memories (ref. 6.4d, Chapter 21). 



A 



*& 




Pockels cell modulator 
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FIG. 3.1 5 Beam switching using a Pockets cell modulator. As the applied voltage is < hanged from zero to 
V n the beam is switched from position 1 to position 2. 
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FIG. 3.16 Schematic diagram of a beam deflector: (a) the double prism KDP beam deflector and lb) the 
principle of beam deflection in a medium where the refractive index varies linearly in a direction normal to 
the direction of propagation. 

Alternatively the arrangement shown in Fig. 3. 1 6(a) can be used for beam switching. Here 
we have two similar prisms of KDP, for example, but with opposite orientations. Thus, if 
an electric field is applied in the z direction and the optical beam travels in the direction of 
one of the induced principal axes with its polarization parallel to the other principal axis, 
then the beam will 'see' different refractive indices in the two prisms. The difference in refrac- 
tive indices will be n\f&:, that is, from eqs (3.4), we see that a ray entirely in the upper prism 
travels in a medium of refractive index 



n 1 = n. 



re, 

— r i. 



while a ray entirely in the lower prism travels in a medium where the effect of the applied 
field is reversed so that 



n, =n r 



r%_ 
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We can see that this arrangement leads to a deflection of the beam by considering Fig. 3.16(b). 
Here we show a crystal whose refractive index and hence optical path length varies with 
the transverse distance x across the crystal. If we assume that the variation of n with x is 
uniform then ray 1 'sees' a refractive index n while ray 2 'sees' an index n + An. The rays 
1 and 2 will traverse the crystal in times /, and r 2 where 

Ln L(n + An) 

t { - — and t 2 - 

c c 

The difference in transit times results in ray 2 lagging behind ray 1 by a distance 
Ay = LAn/n; this is equivalent to a deflection of the wavefront by an angle 6' measured inside 
the crystal just before the beam emerges. We can see from Fig. 3. 1 6(b) that 9' = Ay/W, where 
W is the width of the crystal. 

Using Snell's law, the angle of beam deflection 9 measured outside the crystal is given 
by (assuming 9 is small) 

9 = n& = nAy/W 
or 

9 = LAn/W 
Thus, using the arrangement shown in Fig. 3.16(a), the deflection 6 is given by 

Q = (L/W)nlr%. (3.17) 

Light beams can also be deflected by means of diffraction gratings which are electro-opti- 
cally induced in a crystal by evaporating a periodic metallic grating electrode onto the crystal. 
A voltage applied to this electrode induces a periodic variation in the refractive index thereby 
creating an efficient phase diffraction grating. This technique is especially useful in the inte- 
grated optical devices discussed in Chapter 9. Beam deflectors and scanners are used in laser 
displays, printers and scribers, for optical data storage systems and in optical character recog- 
nition. 



C 3 - 7 J Magneto-optic devices 

The presence of magnetic fields may also affect the optical properties of some substances 
thereby giving rise to a number of useful devices. In general, however, as electric fields are 
easier to generate than magnetic fields, electro-optic devices are usually preferred to magneto- 
optic devices. 

3.7.1 Faraday effect 

This is the simplest magneto-optic effect and the only one of real interest for optical mod- 
ulators; it concerns the change in refractive index of a material subjected to a steady mag- 
netic field. Faraday found in 1845 that when a beam of plane polarized light passes through 
a substance subjected to a magnetic field, its plane of polarization is observed to rotate by 
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TABLE 3.3 Typical values of the Verdet 
constant V for A = 589.3 nm 



Material 


V'fradm ] T ') 


Quartz (SiO,) 


4.0 


Zinc sulfide (ZrtS) 


82 


Crown jjLiss 


6.4 


Flint glass 


23 


Sodium chloride (NaCl) 


9.6 




an amount proportional to the magnetic field component parallel to the direction of propa- 
gation. This is very similar to optical activity which, as we saw in section 3.3, results from 
certain materials having different refractive indices n r and /?, for right and left circularly polar- 
ized light. There is one important difference in the two effects. In the Faraday effect the sense 
of rotation of the plane of polarization is independent of the direction of propagation. This 
is in contrast to optical activity where the sense of rotation is related to the direction of prop- 
agation. Thus in the case under discussion the rotation can he doubled by reflecting the light 
back through the Faraday effect device. 

The rotation of the plane of polarization is given by 



8 = VBL 



(3.18) 



where V is the Verdet constant (see Table 3.3 for some representative values), B is the mag- 
netic flux density parallel to the direction of propagation and L is the path length in the mate- 
rial. The Faraday effect is small and wavelength dependent; the rotation for dense flint glass 
is 6=1.6° mm" 1 T 1 at \ )= 589.3 nm. 

We can also express 6 in terms of the refractive indices n T and «,, that is 

9 = — (n r -n,)L 

A Faraday rotator used in conjunction with a pair of polarizers acts as an optical isolator 
which allows a light beam to travel through it in one direction but not in the opposite one. 
It may therefore be used in laser amplifying chains to eliminate reflected, backward-trav- 
elling waves, which are potentially damaging. The construction of a typical isolator is shown 
in Fig. 3.17. 

Light passing from left to right is polarized in the vertical plane by polarizer P,. The Faraday 
rotator is adjusted to produce a rotation of 45° in the clockwise sense. The second polarizer 
P 2 is set at 45° to P, so that it will transmit light emerging from the rotator. However, a beam 
entering from the right will be plane polarized at 45° to the vertical by P 2 and then have its 
plane rotated by 45° in the clockwise sense by the rotator. It will therefore be incident on P r 
with its plane of polarization at right angles to the plane of transmission and be eliminated. 
The device thus isolates the components on its left from light incident from the right. 

One potential application of magneto-optics currently receiving attention is large capac- 
ity computer memories. Such memories must be capable of storing very large amounts of 
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FtC. 3.17 Optica! isolator based on the Faraday ett'ect. The reflected ra\ is >hm\n displaced tor clarity. 



information in a relatively small area and permit very rapid readout and. preferably, random 
access. The usual magnetic memories have a number of limitations of size and reading speed. 
Optical techniques can overcome both these constraints (ref. 3.4). 

The magneto-optic memories developed so far are read via the Faraday effect or the mag- 
netic Kerr effect, which relates to the rotation of a beam of plane polarized light reflected 
from the surface of a material subjected to a magnetic field. In either case a magnetized ferro- 
or ferrimagnetic material rotates the plane of polarization of laser light incident on it. 

Writing may be achieved by heating the memory elements on the storage medium to a 
temperature above the Curie point using a laser beam. The element is then allowed to cool 
down in the presence of an external magnetic field thereby acquiring a magnetization in a 
given direction. Magnetizations of the elements in one direction may represent 'ones', in 
the opposite direction 'zeros'. To read the information the irradiance of the laser beam is 
reduced and then directed to the memory elements. The direction of the change in the polar- 
ization of the laser beam on passing through or being reflected from the memory elements 
depends on the directions of magnetization; therefore we can decide if a given element is 
storing a 'one' or 'zero'. 

Systems, incorporating, for example, a 50 mW He-Ne laser with a Pockels modulator 
and manganese bismuth (MnBi) thin film storage elements, have enabled information to be 
stored, read and erased at rates in excess of I megabits per second. 



C 3 - 8 ) Acousto-optic effect 

The acousto-optic effect is the change in the refractive index of a medium caused by the 
mechanical strains accompanying the passage of a surface acoustic (strain) wave along the 
medium. The strain and hence the refractive index varies periodically with a wavelength A 
equal to that of the acoustic wave. The refractive index changes arc caused by the photo- 
elastic effect which occurs in all materials on the application of a mechanical stress. It can 
be shown that the change in refractive index is proportional to the square root ot the total 
acoustic power (ref. 3.5). 

In general, the relationships between changes of refractive index and mechanical strain, 
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and between the strain and stress, are rather complicated (ref. 3.6). However, the change in 
the refractive index can be visualized, as was the case for the electro-optic effect, as a change 
in the size, shape and orientation of the index ellipsoid. For simplicity let us consider the 
case of a monochromatic light wave, wavelength X, travelling in a medium in which an 
acoustic wave has produced sinusoidal variations of wavelength A in the refractive index. 
The situation is shown in Fig. 3.18, where the solid horizontal lines represent acoustic wave- 
peaks (pressure maxima) and the dashed horizontal lines represent acoustic wave troughs 
(pressure minima). The portions of the wavefront near to a pressure peak will encounter a 
higher refractive index and therefore advance with a lower velocity than those portions of 
the wavefront which encounter pressure minima. The wavefront in the medium therefore 
soon acquires the wavy appearance shown by the dashed curve in Fig. 3.18. The acoustic- 
wave velocity is very much less than the light wave velocity, so we may ignore it and con- 
sider the variation in refractive index to be stationary in the medium. 

As elements of the light wave propagate in a direction normal to the local wavefront, almost 
all the wave elements will suffer a change in direction leading to a redistribution of the light 
flux, which tends to concentrate near regions of compression. In effect, the acoustic wave 
sets up a diffraction grating within the medium so that optical energy is diffracted out of 
the incident beam into the various orders. There are two main cases of interest, namely (a) 
the Raman-Nat h regime and (b) the Bragg regime. 
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FIG. 3.1 8 Schematic illustration of acousto-optic modulation. The acoustic waves change the refractive 
index of the medium in a periodic way so that the plane optical wavefronts take on the 'wavy' appearance 
shown (very much exaggerated) as they propagate through the medium. 
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In the Raman-Nath regime the width of the acoustic beam is so small that the diffracted 
light suffers no further redistribution before leaving the modulator. The light is then diffracted 
as though from a simple plane grating such that 

m\ = Asir\Q m ( 3 - 19 ) 

where m = 0,±\, ±2, ... is the order and 8,„ is the corresponding angle of diffraction, as illus- 
trated in Fig. 3.19. 

The irradiance / of the light in these orders depends on the 'ruling depth" of the acoustic 
grating, which is related to the amplitude of the acoustic grating. This, in turn, is related to 
the amplitude of the acoustic modulating wave (i.e. the stress produced). The fraction of light 
removed from the zero-order beam is n = (/„-/)//,,. where /,, is the transmitted irradiance 
in the absence of the acoustic wave. Thus amplitude variations of the acoustic wave are trans- 
formed into irradiance variations of the optical beam. 

The physical basis of the Bragg regime is that light diffracted from the incident beam is 
extensively rediffracted before leaving the acoustic field. Under these conditions, the acoustic 
field acts very much like a 'thick' diffraction grating, that is a grating made up of planes 
rather than lines. The situation is then very similar to that of Bragg diffraction (or 'reflec- 
tion') of X-rays from planes of atoms in a crystal. Consider a plane wavefront incident on 
the grating planes at an angle of incidence 8, as shown in Fig. 3.20(a); significant amounts 
of light will emerge only in those directions in which constructive interference occurs. The 
conditions to be satisfied are: (a) light scattered from a given grating plane must arrive in 
phase at the new wavefront and (b) light scattered from successive grating planes must also 
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FIG. 3.19 Geometry for Raman-Nath (or transmission-type) acousto-optk diffraction grating m 
The amount of light diffracted into the orders m?1 from the incident beam, and homo tin- mod 
the transmitted beam, depends on the amplitude of the modulating signal. 
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FIG. 3.20 Geometry for Bragg (or reflection-type) acousto-optic diffraction grating modulation: (a) 
incident rays being scattered from successive layers - for constructive interference the path difference 
AB + BC must equal an integral number of wavelengths m\; (b) the amount of light 'reflected' into the first 
order depends on the amplitude of the modulating signal (refraction of the light beam at the boundaries of 
the acousto-optic crystal has been omitted for simplicity). 



arrive in phase at the new wavefront, implying that the path difference must be an integral 
number of wavelengths. The first of these conditions is satisfied when 8 d = 0,, where 9, is 
the angle of diffraction. The second condition requires that 

sin 9, + sin 9 d = mX/A 

with m = 0, l , 2 . . . . The two conditions are simultaneously fulfilled when 



sin 



mX 
9, = sin9 d = — 



(3.20) 



The diffraction is similar to that obtained with a plane grating, but only for special angles 
of incidence; the angle of incidence must equal the angle of diffraction. 

Although in the simplified theory given above strong scattering can take place when m 
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is equal to any positive integer, a more rigorous treatment, taking into account the fact that 
scattering is not from discrete planes but from a continuous medium, shows that scattering 
only takes place when m= 1. This is shown in Fig. 3.20(b): the equation for the so-called 
Bragg angle 8 B then becomes sin Q B =X/2\ (Problem 3.9). The modulation depth (/ -/)// 
(or diffraction efficiency, r|) in this case can theoretically equal 10(K<- in contrast to about 
34% for the Raman-Nath case. At the Bragg angle n is given by 

ri = sin 2 <t)/2 (3.21) 

where § = {2ii/\)(AnL/cos 6 B ), in which An is the amplitude of the refractive index fluctu- 
ation, L the length of the modulator and 9 B , the external Bragg angle, is related to 6 B , the 
internal Bragg angle, by n sin 6 B = sin 8' B (ref. 3.7). 

The acoustic waves, which create the diffraction grating, are of course moving through 
the medium and, as a consequence, the diffracted wave behaves as if it had been reflected 
from a 'mirror' moving with the same velocity as the grating, and therefore appears to 
originate from a source moving at twice the mirror (or grating) velocity. Thus the fre- 
quency of the reflected beam is changed by the Doppler effect (see section 5.7) and is 
given by 

v' = v [l±2i; x (c//i)] 

where ±\\ is the component of the velocity of the acoustic wave along (or away from) the 
original beam direction and /; is the medium refractive index. The frequency shift is thus 

Av = v'-v = ±2v (1 iy?/c 

If the light is incident at an angle 6; = 9 d to the acoustic wave as shown in Fig. 3.20(a), then 
v x = i' a sin G (l , where v. d is the acoustic wave velocity. The frequency shift is then 

Av = ±2v ?;„ sinOJ/z/r (3.22) 

Combining eqs (3.22) and (3.20) and taking m = 1 gives a frequency shift of ±r a /A or ±/ , 
where / is the acoustic wave frequency. This change in frequency can be used as the basis 
of a frequency modulator. 

The minimum time required to move from a condition where the acoustic wave interacts 
with the light beam and 'turns off the undiffracted light to a condition where there is no 
diffraction is the transit time of the acoustic wave across the optical beam. This is simply, 
from Fig. 3.19, t mm = B/v il , where B is the optical beam width. Hence the bandwidth of the 
modulator is limited to about v.jB. Commercial modulators have bandwidths of up to 
50 MHz. This limitation is partly due to the frequency dependence of the acoustic losses of 
available acousto-optic materials. At the present time only LiNbO^, PbM0 4 and Te0 2 appear 
to have sufficiently low loss to have a reasonable prospect of being operated at appreciably 
higher frequencies. 

Acousto-optic modulators can in general be used for similar applications to electro-optic 
modulators, though they are not so fast. On the other hand, because the electro-optic effect 
usually requires voltages in the kilovolt range, the drive circuitry for modulators based on 
this effect is much more expensive than for acousto-optic modulators, which operate with 
a few volts. 
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C 3 - 9 J Quantum well modulators 



We saw in section 2.4.3 that the presence of excitons with binding energies of a few mcV 
in a semiconductor result in the presence of energy levels in the forbidden energy gap. and 
consequently the absorption of light with photon energies smaller than the bandgap energy 
E z . It was also mentioned that in quantum well structures the excitons are more strongly 
bound, and that the associated absorption is observable at room temperature. If an electric 
field is applied to a quantum well structure, the electron and hole wavefunctions are sepa- 
rated by being pushed to opposite sides of the well, resulting in a reduction in the absorp- 
tion, and more importantly a shift of the absorption spectrum to lower energies, that is longer 
wavelengths. This shift is much larger than in bulk material, and may be -20 me V for elec- 
tric fields ~lu 7 VrrT 1 in a GaAs/GaAlAs quantum well 10 nm wide (ref 3.8a). This shift, 
referred to as the quantum confined Stark effect (QCSE), may be used as the basis of an irra- 
diance modulator. At zero field, therefore, light of wavelength slightly longer than the 
bandgap wavelength ! g is fully transmitted, while it is strongly attenuated when the field is 
applied. This is an example of electroabsorption. 

An efficient QCSE modulator may comprise a multiple quantum well (MQW) structure, 
that is a sequence of perhaps 40 or more quantum wells, built into a p-i(MQW)-n diode 
as shown in Fig. 3.21. The MQWs consist of layers of GaAs 9-10 nm thick separated by 
layers of AlGaAs of similar thickness. The diode structure is fabricated on a GaAs substrate, 
which is absorbing at the wavelength of operation so that a window is usually etched into 
it as shown. Similar devices can be constructed from InGaAs/lnAlAs MQWs grown on InP 
substrates. 

In contrast to the vertical geometry shown in Fig. 3.21 it is possible to construct the QCSE 
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FIC. 3.21 Diagram of a vertical geometry p-i(MQWI-n QCSE modulator. 
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FIG. 3.22 Schematic illustration of a guided wave MQW QCSE modulator. 

devices for integrated optical applications (see section 9.4). Here the light to be modulated 
enters one end of a guided wave structure as shown in Fig. 3.22. The device may be some 
200 u.m long, in contrast to the 1-2 u.m height of the structure in Fig. 3.21. so that com- 
paratively few quantum wells are required to achieve the desired electroabsorption; such 
devices are also called travelling wave modulators. 

The high frequency performance of QCSE modulators depends on what might be regarded 
as intrinsic factors such as the times that electrons and holes remain in the quantum wells, 
and extrinsic factors such as the RC time constant as in any junction diode. With care mod- 
ulation bandwidths greater than 20 GHz can be achieved. 



EXAMPLE 3.4 Acousto-optic modulator 



Given the following data for a PbM0 4 acousto-optic modulator, we may calculate the 
Bragg angle, the maximum change in refractive index of the material and the maximum 
width of the optical beam of wavelength >. = 633 nm that may be modulated with a 
bandwidth of 5 MHz. 

The modulator length is 50 mm, diffraction efficiency 70%, while the acoustic wave- 
length is 4.3 x 10~ 5 m and the acoustic velocity is 3500 m s _1 . 

The angle of diffraction (from eq. 3.20) is 



9 R = sin" 



' 633x10-" 
2x4.3x 10" 



= 7.4mrad(or0.42°) 



The value of <j> is given by eq. (3.21) 

^ = 2 sin -1 Vr| = 2 sin -1 Vo?7 

4>= 113.6° 

Therefore 

<bX. cos 6 R , 

An = - -=1.27xl(T 3 

2nL 
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The bandwidth is x>jB and hence the maximum optical beam width B is 

3500 

0.7 mm 



5 x 10 f 



C 3 - 10 ) Non-linear optics 

Practical applications of non-linear optical effects have arisen as a direct consequence of 
the invention of the laser. The very high power densities made available by lasers have 
enabled several phenomena, which were previously regarded as theoretical curiosities, to 
be observed and exploited. 

The explanation of non-linear effects lies in the way in which a beam of light propagates 
through a solid. The nuclei and associated electrons of the atoms in the solid form electric 
dipoles. The electromagnetic radiation interacts with these dipoles causing them to oscil- 
late which, by the classical laws of electromagnetism, results in the dipoles themselves acting 
as sources of electromagnetic radiation. If the amplitude of vibration is small, the dipoles 
emit radiation of the same frequency as the incident radiation. As the irradiance of the radi- 
ation increases, however, the relationship between irradiance and amplitude of vibration 
becomes non-linear resulting in the generation of harmonics of the frequency of the radia- 
tion emitted by the oscillating dipoles. Thus frequency doubling or second-harmonic gen- 
eration and indeed higher order frequency effects occur as the incident irradiance is increased. 
The electric polarization (or dipole moment per unit volume) P can be expressed as a power 
series expansion in the applied electric field % by 



p = ^%z + x^ 2 + x^ + -) 



(3.23) 



where % is the linear susceptibility and % 2 , %>, ... are the non-linear optical coefficients. The 
resulting (non-linear) relationship between P and £ is shown in Fig. 3.23. 



i' Polarization,/' 




Electric field,/ 



FIG. 3.23 Curve showing electric polarization versus, electric field for a non-linear material (i.e. a material 
lacking a centre of symmetry), assuming Xi is negative. 



120 MODULATION OF LIGHT 



If the applied field is of the form f = e,, sin (at such as produced by an electromagnetic 
wave, then substitution into eq. (3.23) gives 

P = EuW) sin ut + x,&l + x^-l sirrW + 



■ ) 



: e [x'£ ( > sin wt + \ x 2 6 2 >( I - cos 2tot) + 



(3.24) 



Equation (3.24) contains a term in 2<d which corresponds to an electromagnetic wave having 
twice the frequency of the incident wave. The magnitude of the term in 2m, however, does not 
approach that of the first term e x'£ until the electric field is about 10 6 V m" 1 (which is not 
entirely negligible in comparison with the internal fields of crystals, i.e. f- mx ~ 10" V m" '). A 
field of 10 fl V rrf ' corresponds, at optical wavelengths, to a power density of about 10 5 W m" 
while the electric fields and power density of sunlight are of the order of 100 V m" and 
20 W irf 2 respectively, so it is not too surprising that the observation of non-linear effects had 
to await the advent of the laser. (Large non-linear effects have been observed in some semi- 
conductors with power densities of only about 5 x 10 4 W m -2 due to free carrier effects.) 
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FIG. 3.24 Applied sinusoidal optical (i.e. electrical) field and the resulting polarization tor a non-linear 
material (a) and Fourier analysis of the asymmetrical polarization wave (b) into (i) a fundamental wave 
oscillating at the same angular frequency to) as the wave inducing it, (ii) a second harmonk of twice that 
frequency (2o>) and (iii) an average (d.c.) negative component. 
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Harmonic generation is only observed in those solids that do not possess a centre of symme- 
try. In symmetric materials, an applied electric field produces polarizations of the same mag- 
nitude but of opposite sign according to whether the electric field is positive or negative and 
there is no net polarization. Consequently the coefficients of even powers of 7 in eq. (3.24) 
are zero. In anisotropic media such as quartz, ADP and KDP, however, harmonics are gener- 
ated as indicated in Fig. 3.24(a) where we see that the symmetrical optical field produces an 
asymmetrical polarization. A Fourier analysis of the polarization (Fig. 3.24b) shows that it con- 
sists of components having frequencies co and 2to as well as a d.c. component. 

Second-harmonic generation was first observed in 1961 by Franken and his co-workers 
(ref. 3.9), who focused the 694.3 nm output from a ruby laser onto a quartz crystal as shown 
in Fig. 3.25 and obtained a very low intensity output at a wavelength of 347. 15 nm. In these 
experiments the conversion efficiency from the lower to the higher frequency was typically 
10" 6 % to 10" 4 %. The reason for this is that wavelength dispersion within the crystal causes 
the frequency-doubled light to travel at a different speed from that of the fundamental. As 
the latter is generating the former throughout its passage through the crystal, the two waves 
periodically get out of phase and destructive interference occurs. The irradiance of the fre- 
quency-doubled light thus undergoes fluctuations through the crystal with a periodicity of 
/ c , which is called the coherence length and is typically only a few micrometres. 

We can derive an expression for / c as follows. Let us consider a plane wave propagating 
through an anisotropic crystal; the fundamental wave has a space-time variation of the form 
exp[i(A,z - co/)], whereas that of the second harmonic is exp[i(A.' 2 r - 2tor)]. The amplitude of 
the second harmonic as it emerges from the crystal can be found by summing the contri- 
butions for the conversion which occurs in each element dr within the crystal, that is 

'£(2u,Z,)«J 'g 2 (co,r)d: (3.25) 

where L is the thickness of the crystal. If we let the time taken for the optical disturbance 
of frequency 2to to travel from each point z to L be x, then we can write relation (3.25) as 

■g(2w,L)« J exp{2i[t I r-co(r-T)]|dr 

where 

L-z (L- z)k. 



z = ■ 



c, m 2to 




Incident light 
from ruby laser 
(\ = 694.3 nm) 

w (X = 694.3 nm) 
crystal ^ 2u (X = 347.2 nm) 

FIG. 3.25 Simplified diagram showing the arrangement for optical frequency doubling. The KDP crystal is 
mounted in the correct orientation for index matching. 
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r-, (i , being the speed of the second harmonic and k-, being the corresponding wavevector. 
Substituting for x we have 



'£(2(0, L) °= exp 



i: 



2/ 



A-,I 



tor 



Integrating and squaring we find that the irradiance of the second harmonic is proportional 
to 



\f,(2(0,L)\ 2 <xL 



sin[a-,-A: 2 /2)Lr 
(*,-*,/2) 



(3.26) 



Relation (3.26) indicates that the irradiance of the second harmonic reaches a maximum after 
the waves have propagated a distance L = l v = n/(2k [ -k 2 ) into the crystal. Thereafter, the 
energy in the second harmonic is returned to the fundamental wave and after two or indeed 
any even number of coherence lengths, the irradiance of the second harmonic falls to zero. 
This difficulty can be overcome by a technique known as index or phase matching. The 
commonest method uses the birefringent properties of the non-linear medium, which of 
course must be anisotropic if harmonic generation is to occur at all. As we have illustrated 
in Fig. 3.26, which shows the normal or index surfaces for »" and ir w (section 3.2), it is pos- 
sible to choose a direction through the crystal such that the velocity of the fundamental (cor- 
responding to the O-ray of frequency co and refractive index u") is the same as that of the 
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Index surface for 
«?"<0) 



FIG. 3.26 Index matching in a negative uniaxial crystal. The condition n;' w = n, u is satisfied for propagation 
at an angle 9 m to the optic axis (6 m = 50° in KDPi. The eccentricities and velocity different vs are greatly 
exaggerated for clarity - similarly the E-ray index surface at frequency Co and the O-ray index surface at 
frequency 2o) have been omitted. 
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second harmonic (corresponding to the E-ray of frequency 2w and refractive index ;/ ( :< "(e)); 
that is, we choose a direction such that /?" = n^'XB). With this technique, known as index 
matching, the coherence length increases from a few micrometres to a few centimetres. The 
conversion efficiency can be increased by orders of magnitude as the fundamental and har- 
monic waves remain in phase and there is a continuous build-up in the irradiance o\' the 
second harmonic. The conversion may be about 20% for a single pass through a K.DP crystal 
a few centimetres long. However, care must be taken in a number of respects to maximize 
the efficiency. For example, as the refractive indices are temperature dependent it may be 
necessary to control the crystal temperature. Furthermore, as only one direction of propa- 
gation is perfectly index matched, the laser beam divergence must be minimized and sim- 
ilarly lasers with broad linewidths may have lower conversion efficiencies. 



EXAMPLE 3.5 Phase matching angle in ADP 



Given the following information for ADP determine the angle 6 m . The appropriate 
refractive indices are 



A. (u.m) i\. 



1 .06 1 .4943 1 .4603 

0.53 1.5132 1.4712 

The phase matching angle is given by (see Problem 3.10) 

«)- 2 -('4 u: 



-2 / _2i,n-2 



sirr8 m = 



m . •)„,. _ i , ->,„. _ i 



Then substituting the numerical values gives 

0.4478 - 0.4367 

sin 2 9 = = 0.44 

m 0.4620 - 0.4367 



w 



hence 9 m = 26°. 



Second-harmonic generation enables us to extend the range of laser wavelengths into the 
blue and ultraviolet parts of the spectrum, which are not rich in naturally occurring laser lines. 
One very important laser application which may benefit from second-harmonic generation 
is laser-induced nuclear fusion, which appears to be more efficient at higher optical fre- 
quencies (see section 6.8). 

Non-linear processes can be described in terms of a photon model, according to which 
we can view second-harmonic generation as the annihilation of two photons of angular fre- 
quency w and the simultaneous creation of one photon of frequency 2co. That is, conserva- 
tion of energy requires that 

tua + hio = /j(2u) (3.27) 
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while conservation of momentum for the photons similarly requires that 

hk* + hk" = hk 2a 
or 

2r = A- 2uj (3.28) 

It is left to the reader to show that this equation is equivalent to the above refractive index 
matching criterion. The photon model forms a useful basis for the discussion of a related 
non-linear phenomenon - that of parametric amplification and oscillation. 

3.10.1 Parametric oscillation 

Second-harmonic generation can be regarded as a special case of sum frequency conver- 
sion, whereby power from a 'pump' wave at angular frequency to, is transferred to waves 
at frequencies co, and co 2 ; that is, 

co 3 ^^ to | + to 7 (3.29) 

The 'reaction* represented by relation (3.29) goes to the left in odd-numbered coherence 
lengths and to the right in even-numbered ones. If the frequency to, alone is applied to a suit- 
able non-linear material such as lithium niobate, then the two smaller frequencies can build 
up from noise; co, and co 2 are known as the 'signal' and 'idler' frequencies respectively. The 
particular subdivision into to, and io 2 is determined by the index matching criterion, which 
conservation of momentum gives as 

hk^+hk^hk^ (3.30) 



EXAMPLE 3.6 Coherence length in second-harmonic generation 



Given that ri° at 0.8 urn is 1.5019 and n 2a at 0.4 urn is 1.4802 in KDP. we can calcu- 
late the coherence length as follows: the coherence length is given by /, = KJ(2k [ - k 2 ), 
which we can show can be written as 



/.- x " 



4(n w -n lw ) 



where X^ is the vacuum wavelength at the fundamental frequency. Therefore, using 
the data provided 

0.8 xlO" 6 

/ = ^10" 5 m 

1 4x0.0217 



It can be seen from relation (3.29) that if co 3 and co, are fixed, then to 2 is also fixed 
(co 2 = | co-, - co, |). If, however, only frequency to-, is fixed, then the other two are free to range 
over many values; this effect is known as parametric amplification. (The term parametric 
derives from electronic engineering, where so-called parametric amplifiers were developed 
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FIG. 3.27 (a) The optical parametric amplifier in which frequencies to, and u, are generated by the pump 
waves, frequency to,. In (b) the non-linear crystal is placed within an optical cavity resonant at frequency 
UJ,, so that oscillations at frequency CO, are generated. 



in the 1950s. The interaction between the signals at different frequencies was achieved by 
varying some circuit 'parameter', typically the capacitance.) 

If the material producing parametric amplification is placed in an optical resonator (Fig 
3.27), such as a Fabry-Perot cavity (see sections 1 .23 and 5.5), tuned to the signal, idler or 
both frequencies, then parametric oscillations, which are coherent, will occur - see section 
5.10. On the other hand, if energy flows into the wave of frequency co, from the signal and 
idler waves, the process is called sum or difference frequency generation, and as mentioned 
above second-harmonic generation, or frequency doubling, is a special case of sum frequency 
generation. 

The index matching criterion (eq. 3.30) is very severe and the process is usually carried 
out in an optical cavity (Chapter 5) with minors that are highly reflecting at to, or co 2 but 
not at CO,. Tuning can be achieved simply by varying the index matching conditions through, 
for example, mechanical or temperature control of the length of the cavity. A schematic 
diagram of the system used by Giordmaine and Miller (ref. 3. 10), who first achieved para- 
metric oscillation in 1965, is shown in Fig. 3.28. In this case, the output was tuned by chang- 
ing the temperature of the lithium niobate crystal. A temperature change of about 11°C 
produced output frequencies in the range of 3.1 x 10 14 to2.6x 10' 4 Hz (which corresponds 
to the wavelength range 968-1 154 nm). While the conversion efficiency in these experi- 
ments was only 1%, efficiencies of about 50% have now been achieved, with coherent wave- 
lengths ranging from the infrared to the ultraviolet being produced. 

The additional laser wavelengths generated via parametric processes are useful in extend- 
ing the range of wavelengths available from the various lasers described in section 5.10. The 
tunability of the wavelength generated is a particularly useful feature in many applications 



126 MODULATION OF LtCHT 



Lithium 
niobate 
crvslal 



-> 

Light from 
an Nd:YAG 
laser X = l.Ufi/dn 



1R filter 



■>■ 



/. = 52y nm 

and l.tifi tim 






->■ 



/. = 524 nm 



Lithium niobate 
crWal u ith coated 
end luceN to form a 
ri'iomini cu\iu 



-> 



Turned output 



^ 




Temperature 
controlled o\en 

FIG. 3,28 Arrangement of the components in Giordmaine and Miller's observation of parametric 
oscillation in lithium niobate. 



such as photochemistry and the study of the kinetics of ultrafast chemical reactions (see the 
references on pp. 511-13 of ref. 6.4d). 



PROBLEMS 



3.1 



3.2 



3.3 



3.4 



3.5 



3.6 



3.7 



Prove that the angle of inclination of the axes of an ellipse (which represents ellipti- 
cally polarized light at a fixed position in space) to the horizontal and vertical direc- 
tions is given by \ tan~ 1 [(e ( |E ( ' ) cos (J))/(e,', 2 - e 2 )] (the symbols as defined in section 3.1). 

The principal refractive indices for quartz, n c and n , are 1 .553 36 and 1 .544 25 respec- 
tively; calculate the thickness of a quarter-wave plate for sodium D light X> = 589.3 nm. 
How may such a plate be made more robust? Calculate the thickness of a calcite X/2 
plate for which n = 1.658 and /? t , = 1.486 at the same wavelength. 

Show how a beam of plane polarized light may be considered as consisting of two oppo- 
sitely directed, circularly polarized beams of light. Hence show that the rotation of the 
plane of polarization in an optically active medium of thickness J is (nc/A )(/! r -« t ). 
The specific rotation of quartz is 29.73° mm" 1 at 3^ = 508.6 nm; calculate the differ- 
ence in refractive indices (n r -»,). 

Using the data given in Table 3.1 calculate some typical half- wave voltages. Why would 
you expect the half-wave voltage to vary with wavelength? 

Calculate the half-wave or switching voltage for a transverse field Pockels modulator 
using lithium niobate at a wavelength of 632.8 nm where the value of the appropriate 
electro-optic coefficient is 6.8 pm V" 1 

Suggest the design of a double-prism KD*P beam deflector to give a beam deflection 
of 2° (the voltage applied to the crystal should not exceed 20 kV). 

Using the data provided in Table 3.1 calculate the modulation power required for a lon- 
gitudinal GaAs modulator to give a retardation of n. Take the modulation bandwidth 
to be 0.1 MHz, the length of the crystal to be 30 mm and the cross-sectional area to be 
100 mm 2 . Verify that the same power would be required if the crystal were used in the 
transverse mode. 
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3.8 Design an optical isolator using zinc sulfide (see Table 3.3); take the permeability of 
zinc sulfide to be unity and assume that the magnetic field is produced by a solenoid 
wound directly onto the zinc sulfide at the rate of 5 turns per mm. 

3.«J Show that, assuming a Bragg acousto-optic modulator is in the form of a parallel-sided 
block with the acoustic wave travelling perpendicularly to these faces (Fig. 3.20b), 
the externa! ray deflection angle is given by sin d' B = ?i,,/2A. Hence calculate the beam 
deflection in a silica modulator, where the acoustic wave velocity is 3.76 x ]()' m s" 1 
and the frequency of the acoustic wave is 50 MHz, for light of 1.06 urn wavelength. 

3.10 Show that the phase matching angle 9 m in a non-linear optical medium is given by 

. oo- 2 -(n;:r 2 

sin" B,„ = , ^ r 

(Hint: from Fig. 3.26 it can be seen that 9 m is the angle at which the O-ray velocity 
at frequency co equals the E-ray velocity at frequency 2to, i.e. it is the angle at which 
the ellipse and circle in the figure intersect: the value of extraordinary refractive index 
as a function of 9 is given in Appendix 2.) 
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Display devices 



We may divide display devices into two broad categories: (a) those that emit their own radi- 
ation (active devices) and (b) those that in some way modulate incident radiation to provide 
the display information (passive devices). However, before discussing the devices them- 
selves, we consider the circumstances under which matter can be induced to emit radiation. 



C 4 - 1 J Luminescence 

Luminescence is the general term used to describe the emission of radiation from a solid 
when it is supplied with some form of energy. We may distinguish between the various types 
of luminescence by the method of excitation. For example: 

Photoluminescence excitation arises from the absorption of photons 
Cathodoluminescence excitation is by bombardment with a beam of electrons 
Electroluminescence excitation results from the application of an electric field (which 
may be either a.c. or d.c.) 

Whatever the form of energy input to the luminescing material, the final stage in the process 
is an electronic transition between two energy levels, £, and E 2 (£,>£,), with the emis- 
sion of radiation of wavelength X where (see section 1.3) 

he 

— ^ E 2~ E \ (4.1) 

Invariably £, and E 2 are part of two groups of energy levels, so that, instead of a single emis- 
sion wavelength, a band of wavelengths is usually observed. 

When the excitation mechanism is switched off, we would expect the luminescence to 
persist for a time equal to the lifetime of the transition between the two energy levels E { and 
E 2 . When this is so, we .speak of fluorescence. Often, however, the luminescence persists for 
much longer than expected, a phenomenon called phosphorescence. Phosphorescence is often 
attributable to the presence of metastable (or very long lifetime) states with energies less than 
E 2 . Electrons can fall into these states and remain trapped there until thermal excitation releases 
them some time later. Materials exhibiting phosphorescence are known as phosphors. 
Generally speaking, phosphor materials depend for their action on the presence within the 
material of impurity ions called activators. These replace certain of the host ions in the crystal 
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lattice. Unless the charge on the activator ion is identical with that of the host ion it replaces, 
the charge balance will be upset and few will be able to enter the lattice. Improved solubil- 
ity of the activator in these circumstances may result from the introduction of further impurity 
atoms with different ionic charge. These are known as co-aciivatnrs. 

We may distinguish between two main types of energy level system. In the first the energy 
levels are those of the activator ion itself, whilst in the second they are those of the host lattice 
modified by the presence of the activator ions. We refer to these two types as 'characteris- 
tic* and 'non-characteristic' respectively. 

In characteristic luminescence, the excitation energy is usually transferred rapidly (i.e. in 
a time very much less than 10"" s) to the activator ion. The persistence of the luminescence 
is then entirely due to the lifetime of the excited state level of the activator. It should be noted 
that, whilst for atomic electric dipole transitions this is of the order of 10"* s. it can be much 
longer if such transitions are forbidden (see ref. 4.1). Hence fluorescence cannot be unam- 
biguously associated with characteristic luminescence. 

In non-characteristic luminescent materials, both activators and co-activators are usually 
present. These create acceptor and donor energy levels in the material (see section 2.4.2), 
although in phosphors these levels are usually referred to as hole and electron mips respec- 
tively. Energy absorption within the solid creates excess electron -hole pairs and. as the hole 
trapping probability is usually much greater than the electron trapping probability, most of 
the excess holes quickly become trapped. Any electron that then finds itself in the vicinity 
of a trapped hole can recombine with it and generate luminescence. As the electrons migrate 
through the crystal, however, they themselves are subject to trapping. The electron traps could, 
of course, act as recombination centres were there appreciable numbers of tree holes present, 
but the difference in trapping probability prevents this. Instead an electron may remain in its 
trap for some time before subsequently being released by thermal excitation. It may then go 
on to be retrapped or to recombine with a trapped hole. This process is illustrated in Fig. 4.1. 
The time that an electron spends in a trap depends on the depth of the trap below the 
conduction band (E C -£J and also the temperature T. It is generally found that the proba- 
bility of escape per unit time can be written in the form Q exp{ -[(£, - E lt )A'71 1 where Q is 
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(a) (b) (c) (d) 

FIG. 4.1 Elcctron-hote generation and recombination processes in non-charat lerisiic luminescent 
materials. Electron-hole pairs are generated by photon absorption (a), and the holes arc quickly trapped at 
acceptor sites (b). Electrons may then recombine with these trapped holes, thereby giving rise to 
luminescent emission id). However, before such a recombination can take place, tin' dec tron itselt may 
spend some time trapped at a donor site (0. 
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a constant approximately equal to 10* s ' (ref. 4.2). Thus with fairly 'deep' traps (i.e. large 
H L - £ (1 ) and at low temperatures the time spent in a trap may be comparatively large, which 
will cause a long persistence of the luminescence after the cessation of excitation. In some 
cases this may amount to hours or even days. 

EXAMPLE 4.1 Luminescence lifetime due to traps 



If we take a value of 0.4 eV for £,.-£,, then the probability of escape per second of 
a trapped electron at room temperature (where AT = 0.025 eV) is of the order of 
10* exp( -0.4/0.025)* 10 s" 1 . 

Hence we would expect a luminescence lifetime of about 0.1 s. 



We turn now to a more detailed discussion of photoluminescence. cathodoluminescence 
and electroluminescence. 



C 4 - 2 3 Photoluminescence 

As we have seen, in photoluminescence energy is transferred to the crystal by the absoiption 
of a photon. In characteristic luminescent materials the activator ion itself absorbs the photon 
directly. It might be expected, therefore, that since the same energy levels are involved in 
absorption as in emission then the wavelengths for absorption and emission would be identical. 
In fact it is found that the peak emission wavelength is invariably shifted towards the red end 
of the spectrum compared with the peak of the absorption spectrum. This phenomenon is 
known as the Stokes shift, and it may be understood by taking account of the effect of the 
vibrations of the surrounding crystal lattice on the energy levels of the activator ions. The 




FIG. 4.2 Assumed impurity site structure in a characteristic luminescent material. The impurity ion (i) is 
surrounded by six host ions (H) each at a distance R from it. 
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latter are often positively charged, typical examples being Cr ; * and Mn'*. although the exact 
charge state depends on the host. For the sake of argument, we assume here that each acti- 
vator ion is positively charged and is surrounded by six equidistant negatively charged ions 
at a distance R from 'the activator as shown in Fig. 4.2. We further assume that in the most 
important vibrational mode of the group, the activator ion remains at rest whilst the six sur- 
rounding negative ions all vibrate radially and in phase. Because of electrostatic interactions, 
the positions of the energy levels of the activator ion will depend on the value of/?. A schematic 
diagram illustrating this variation for two energy levels is shown in Fig. 4.3. The most impor- 
tant feature of this diagram is that the minima of the two curves do not occur at the same values 
ofR, This is perhaps not too surprising, since the equilibrium distribution of charge round 
the activator ion will be different when it is in each of the two states. 

Consider now the absorption of a photon when the activator is in its ground state, where 
the most probable value for R is R {) (the position of minimum energy). Photon absorption is a 
very rapid process and takes place virtually instantaneously as far as the vibration of the sur- 



Energy 




FIG. 4.3 Schematic diagram of the variation in the energy of two electron energy levels of an impurity ton 
in a characteristic luminescent material as a function of the nearest neighbour ion separation R. If the ion is 
initially at the point A on the diagram (where R = R„), then photon absorption can take pla< c, and the 
energy of the ion will change to that of the point B. The surrounding ions then relax to a new equilibrium 
position (K,), and the impurity ion moves from B to C, losing energy by phonon emission. The impurity ion 
may then make a transition to the point D and emit a photon. Once at D the surrounding ions relax bac ■ 
to R t) and the impurity ion returns to the point A, again losing energy by phonon emission. 
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FIC. 4.4 Absorption and emission spectra for thallium-activated potassium chloride iKCl:Tl) at room tem- 
perature. The emission peak occurs at a higher wavelength than that of the absorption curve. This is an 
example of the Stokes shift. 

rounding ions is concerned. On the energy level diagram, therefore, the process may be rep- 
resented by a vertical transition (i.e. with R remaining constant). Immediately after the tran- 
sition to the excited state the surrounding ions will not be at their equilibrium positions for this 
state and subsequently they will relax to their new equilibrium positions (at R = R { in Fig. 4.3). 
When a downward transition takes place, R again remains constant and an inspection of Fig. 
4.3 shows that the emitted photon will then have less energy than that of the absorbed photon. 

We have of course been talking about the most probable transition. The ions surround- 
ing the activator will always be in a state of oscillation, and hence at the instant of absorp- 
tion the value of/? may well differ from /?„. The same will be true for the excited state. Thus 
instead of a single absorption (or emission) wavelength a band of absorption (or emission) 
wavelengths is seen. Further, since the amplitude of the oscillation will increase with increas- 
ing temperature, we would expect the width of the absorption and emission bands also to 
increase with increasing temperature. Figure 4.4 shows a curve of absorption and emission 
in KC3: Tl and illustrates the Stokes shift seen in this material at room temperature. 

The Stokes shift finds commercial application in fluorescent lamps, in which an electri- 
cal discharge is passed through a mixture of argon and mercury vapour. The emitted radi- 
ation has a bluish colour and an appreciable amount of the radiant energy is in the ultraviolet. 
If the walls of the discharge tube are coated with a suitable luminescing material, the ultra- 
violet radiation may be converted to useful visible radiation, thereby increasing the luminous 
efficiency of the lamp. 



(4.3j f.; Cathodoluminescence 

In cathodoluminescence, although the emission processes are the same as those outlined 
above for photoluminescence, the excitation mechanisms are somewhat more involved. When 
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a beam of energetic electrons (say with energy greater than I keV) strikes a solid, a frac- 
tion (about 10%) is backscattered. The remaining electrons penetrate into the solid where 
they rapidly lose energy, mainly by causing bound electrons to be ejected from their parent 
ions. These secondary electrons in turn may generate further secondary electrons, provided 
they have sufficient energy. The final stages in the energy loss process consist of the exci- 
tation of electrons from states at the top of the valence band (with energy E % ) to those at the 
bottom of the conduction band (with energy Ej. Energy conservation considerations alone 
would dictate that the exciting electrons must at this stage have an energy of at least 
£ c -£ v = £ g above £ c if they are to create electron-hole pairs and still remain in the con- 
duction band. In fact the additional constraint of momentum conservation requires a some- 
what higher minimum energy of approximately £ c + 3£ g /2 (ref. 4.3). When electrons have 
energies between this value and £ t ,, they can only lose energy by exciting lattice vibrations 
(phonons). In addition, of course, even when electrons have higher energies than £ c + 3£ g /2, 
energy may still be wasted in phonon generation. It has been found empirically for a range 
of semiconductor materials that the total number of electron-hole pairs generated may be 
written as £ B /P£„ where E B is the total electron beam energy and (3 * 3. This inefficiency 
in the generation of electron-hole pairs is a major factor in causing cathodoluminescence 
to be a considerably less efficient process than photoluminescence. 

In non-characteristic materials, electron-hole recombination and luminescent emission 
then take place as for photoluminescence. In characteristic materials, on the other hand, it 
is thought that the next step is the formation of excitons (bound electron-hole pairs, see 
section 2.4.3). These migrate through the lattice and may subsequently transfer their recom- 
bination energy to the activator ions. 

As the primary electrons rapidly lose energy they penetrate only a little way into the solid 
they are exciting. It has been found experimentally that the penetration depth, or range, R e 
of an electron beam of energy £ B is given by (ref. 4.4) 

R z = KEl < 4 - 2 > 

where the parameters K and b depend on the material. For ZnS. for example, the range is 
in micrometres when K= 1.2 x 10~ 4 and b = 1.75. Thus a 10 keV electron beam has a range 
in ZnS of 1.2 x 10' 4 x 10 !7S or 0.7 urn. 

It is often found that cathodoluminescent efficiency increases with increasing beam 
ltage. This may be attributed mainly to the fact that at low beam voltages most of the elec- 
tron-hole pairs are generated close to the surface of the luminescent material, where there 
is often a relatively high concentration of non-luminescent recombination centres. (Another 
instance of a similar deleterious surface effect is the reduction in efficiency of photocon- 
ductive detectors with decreasing wavelength, as discussed in section 7.2.7.) 



vo 



C 4 - 4 3 - Cathode ray tube 

The relative ease with which a beam of electrons can be directed and focused led to the early 
development and continued use of the cathode ray tube (CRT) as an important tool for the 
analysis of rapidly varying electrical signals as well as providing a versatile optical display 
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device. Only a very brief discussion of the CRT can be attempted here and the reader is 
referred to rel". 4.5 for more details. Figure 4.5 shows the basic construction. Electrons are 
generated by thermionic emission (see section 2.6) by heating a specially impregnated 
cathode surface (usually based on oxides of barium and strontium) and then focused onto 
the viewing screen by a series of metal electrodes held at various potentials. A grid for the 
control of the electron flow is usually also included. The whole assembly is known as an 
'electron gun'. The electron beam is scanned across the viewing screen in a series of lines; 
when one line scan is completed the beam is rapidly switched to the start of the line below. 
Beam deflection is controlled by electrostatic or electromagnetic fields acting at right angles 
to the beam direction. Electrostatic deflection enables the highest beam deflection rates to 
be achieved, whilst electromagnetic deflection enables higher beam accelerating potentials 
to be employed, which results in a smaller spot size and higher screen brightness. When the 
beam strikes the viewing screen, radiation is generated by cathodoluminescence. The screen 
consists of a thin layer of small (dimensions ~ 5 mn) phosphor granules, with a layer of alu- 
minium (= 0.1 um thick) evaporated onto the gun side (Fig. 4.6). This layer serves two pur- 
poses: first it prevents charge build-up on the phosphor granules (which generally have low 
conductivities) and secondly it helps to reflect light emitted in a direction away from the 
observer back towards him or her. 

The thicknesses of both the aluminium and phosphor layers are fairly critical. If the alu- 
minium is too thick, an appreciable fraction of the electron beam energy will be absorbed 
within it, whilst if it is too thin its reflectivity will be poor. If the phosphor layer is too thick, 
scattering and absorption reduce the light output, whilst too thin a layer, on the other hand, 
can result in incomplete coverage of the screen area. 

For normal display operations (e.g. television) the beam is scanned line by line over the 
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Electron beam 
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FIG. 4.5 Schematic diagram of a CRT. An dec Iron beam originating at an electron gun passes through an 
electrostatic deflection system (which uses two sets of plates at right angles, one for the Y deflection and 
the other for the 'y' deflection) and then falls onto a phosphor screen. Details of the electron gun and its 
associated focusing system are not shown. 
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FIG. 4.6 Cross-section of a CRT screen. A layer of phosphor is sandwiched between a glass faceplate and 
an evaporated aluminium layer. High energy electrons penetrate the aluminium and excite 
cathodoluminescence in the phosphor particles. The aluminium layer reduces charge build-up and helps to 
reflect light back out through the faceplate. 



viewing area. In video applications the display consists of some 625 lines in Europe and 525 
in North America. To avoid an image that 'flickers' the picture must be renewed at a rate 
greater than about 45 Hz. However, it is possible to avoid having to renew the entire picture 
at this rate by using a raster scan that splits the picture up into two interlaced halves. Thus 
if a complete picture scan takes /,. seconds then we may arrange that during the first tjl 
seconds lines 1, 3, 5, 7, etc., are scanned, whilst during the second rj2 seconds lines 2, 4, 
6, 8, etc., are scanned. Because the two images are effectively superimposed, the eye treats 
the picture repetition rate as if it were 2/r s Hz rather than l/r s Hz. This reduction in the rate 
at which picture information is required before flicker becomes troublesome is very useful 
because it halves the transmission frequency bandwidth that would otherwise be required. 
In Europe the entire picture is scanned in 1/25 s, whilst in North America this time is l/30s. 
Varying light irradiances are obtained by varying the beam current. Ideally, the phosphor 
used should have a luminescent decay time shorter than the picture cycle time, otherwise 
streaking effects due to image persistence are obtained. CRT displays can be made suffi- 
ciently bright for them to be visible under nearly all ambient lighting conditions. The bright- 
ness limits are usually reached when the phosphor screen rapidly deteriorates under high 
beam currents. 

Colour displays for home video viewing are obtained using the 'shadowmask' principle, 
in which three electron guns are used that are slightly inclined to each other so that their 
beams coincide at the plane of the shadowmask. The latter is a metal screen, with holes in 
it, placed just in front of the phosphor screen. Having passed through one of the holes in the 
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Shadowmask 



Phosphor screen 

FIG. 4.7 Use of the shadowmask for obtaining colour displays. Three separate guns are used; these are 
inclined slightly to each other so that their beams will all pass through a single hole in the shadowmask. 
After passing through the hole, the beams diverge so that each falls on one of the three circular areas 
composed of phosphors, each of which emits one of the three primary colours (blue, green and red). 



shadowmask the three beams diverge, and on striking the phosphor screen are again phys- 
ically distinct. The phosphor screen consists of groups of three phosphor dots, placed so that 
when the three beams pass through a hole in the shadowmask they each hit a different dot. 
Figure 4.7 illustrates the basic geometry. Each of the three phosphor dots emits one of the 
primary colours (i.e. blue, green and red), so that any desired colour can be generated by 
varying the relative excitation intensities. Some of the more commonly used phosphors are 
zinc sulfide doped with silver: ZnS:Ag (blue); zinc cadmium sulfide doped with copper: 
Zn/Td^^Cu (green); and yttrium oxysulfide doped with europium and terbium: 
Y 2 0,S:Eu,Tb (red). The first two are non-characteristic phosphor materials whilst the last 
is a characteristic material. There is obviously some loss in resolution capability over a mono- 
chrome display since the coloured dots are physically displaced on the screen. Furthermore, 
the alignment of the shadowmask with the guns and the phosphor screen is critical, and can 
be spoilt by fairly harsh environmental conditions such as stray magnetic fields. Several mod- 
ifications have been made to the basic shadowmask principle to try and rectify some of these 
disadvantages; the interested reader is referred to ref. 4.5 for further information. 

To render the screen more easily visible in the presence of ambient radiation the face- 
plate is usually tinted a shade of grey, with a transmission of about 50%. This increases the 
contrast between the light emitted from the phosphor particles (which traverses the faceplate 
once) and the ambient light reflected from the back of the phosphor (which traverses the 
faceplate twice). A recent proposal has been to put an appropriately coloured 'microfilter' 
between each area of phosphor and the faceplate. The filters allow most of the radiation 
emitted by the area of phosphor to pass through, whilst absorbing much of the incident 'white' 
ambient light. The faceplate can now be made of clear glass, so that the screens are almost 
50% brighter (or alternatively achieve the same brightness with the use of less power). 
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Electroluminescence 



We are concerned here with what might be termed 'classical electroluminescence* as opposed 
to 'injection electroluminescence', which uses fabricated p-n junctions and which will be 
dealt with in section 4.6. Four main types of device may be distinguished, depending on the 
type of drive (a.c. or d.c.) and the character of the active layer (powder or thin film). The 
first electroluminescent device to be extensively studied was the a.c. powder device, pro- 
posed in 1936. In this a phosphor powder (usually ZnS:Cu) is suspended in a transparent 
insulating binding medium of high dielectric constant and is sandwiched between two elec- 
trodes (one of which is transparent) as shown in Fig. 4.8(a). Usually there is no complete 
conducting path between the electrodes, so that d.c. excitation is not possible. When an alter- 
nating voltage, V Q cos(27i/r), is applied across the cell, however, light is emitted in the form 
of short bursts which last about 10" 3 s and occur once every half cycle. It is found that the 
integrated light output power/' can be written in the form (ref. 4.6) 
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FIG. 4.8 Construction of an a.c. electroluminescent device (a). Phosphor particles art- suspended within a 
transparent insulating medium and sandwiched between two electrodes, one of which is transparent. 
When an alternating voltage is applied to the electrodes the phosphor particles emit light. (1>) The 
construction of a d.c. electroluminescent device. The phosphor particles have a coating oi (u v S. This 
coating it removed from the anode side of the particles in contact with the anode by the application of an 
initial high current pulse. Under normal conditions, light is emitted only from the Cu,5 depleted particles. 
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(4.3) 



where V x is a constant and P {) (.f) is a function of frequency. The strongest emission from 
within the phosphor grain is found to take place from the side temporarily facing the cathode. 
Several possible emission mechanisms have been proposed (sec ref. 4.6 for further details); 
it is generally agreed, however, that there will be a high electric field within the phosphor 
particle. It is then possible that this field is sufficiently strong to enable electrons from occu- 
pied acceptor levels to 'tunnel' to states of the same energy in the conduction band, as illus- 
trated in Fig. 4.9(a). (Quantum mechanical tunnelling was briefly mentioned in section 2 
8.5.) Other electrons in the conduction band are then able to fall into these vacated levels 
and emit radiation (Fig. 4.9b). 

Another possibility is that an electron moving in the electric field may acquire sufficient 
energy to enable it to excite an electron from the valence band to the conduction band. The 
resulting hole quickly becomes trapped at an impurity acceptor site, thereby effectively emp- 
tying it of an electron. An electron in the conduction band can then make a radiative tran- 
sition by falling into the empty acceptor level. The sequence of events is illustrated in Fig. 
4.10. In phosphors containing manganese, there is evidence that the Mn ?+ ions themselves 
may be directly excited by the high energy electrons, radiation being emitted when the ion 
subsequently undergoes de-excitation. A.C. powder devices usually require several hundred 
volts (r.m.s.) to drive them. They exhibit luminances of about 40 nits (for a definition of the 
nit, see section 4.8), have power efficiences (i.e. the ratio of optical power out to electrical 
power in) of about 1% and lifetimes of about 1000 hours. By using different phosphor 
powders, red, green, yellow and blue displays are possible. 

A more recent development is the d.c. powder display (Fig. 4.8b). These have a structure 
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FIG. 4.9 Possible mechanism for electroluminescence emission involving quantum mechanical tunnelling, 
in (a) an electron in an acceptor state 'tunnels' through the forbidden gap region into states of the same 
energy. It is only able to do this if there is a considerable electric field present, thus causing the energy 
bands to be tilted. An electron in the conduction band may now fall into the vacated level resulting in 
radiative emission (b). 
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FIG. 4.10 Possible mechanism for electroluminescence emission involving an avalanche process. In (a) an 
electron moving in the high electric fields present may acquire sufficient energy to excite an electron trom 
the valence band into the conduction band. The hole left behind then moves up into an acceptor state 
effectively emptying it of an electron (b). Finally, an electron in the conduction band may then make a 
radiative transfer into the empty acceptor level (d. 
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FIG. 4.1 1 Typical luminances obtained for a.c and d.c. electroluminescent powder devices as a function 
of the applied voltage (r.m.s. volts in the case of the a.c. device). 
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basically similar to that of the a.e. device. However, the phosphor particles (ZnSrCu.Mn) 
are coated with a conducting layer of Cu V S. Provided the phosphor particles are not too widely 
dispersed within the binder there will be a conducting path from the anode to the cathode. 
Before normal operation, the cell must be 'formed' by applying a high voltage across it for 
a short time. This causes copper ions to migrate away from the phosphor surfaces next to 
the anode. A thin high resistance layer of ZnS is then created next to the anode, across which 
most of the applied voltage appears, and from which light emission takes place during sub- 
sequent operation at lower voltages. Luminances of about 300 nits are possible at voltages 
of around 100 V d.c, although the power conversion efficiencies are low at approximately 
0. 1 %. The luminance versus drive voltage characteristics for both a.e. and d.c. powder elec- 
troluminescent devices are shown in Fig. 4.1 1. 

In addition, a.e. and d.c. devices have been made where the active layer is a vacuum- 
deposited thin film of phosphor material (usually based on ZnS). but both types tend to have 
rather poor lifetimes. Although a considerable amount of research effort has been put into 
the development of electroluminescent displays, they have yet to make any significant com- 
mercial impact. They do offer the possibility, however, of large area displays with a high 
surface area to volume ratio, and their output voltage characteristics, particularly of the d.c. 
powder devices, are suitable for matrix addressing, as discussed in section 4.10. 



( 4 - 6 ) Injection luminescence and the light-emitting diode 

The basic structure giving rise to injection luminescence is that of a p-n junction diode oper- 
ated under forward bias which was discussed in Chapter 2. Under forward bias, majority 
carriers from both sides of the junction cross the depletion layer and enter the material at 
the other side, where they are then the minority type of carrier and cause the local minority 
carrier population to be larger than normal. This situation is described as minority carrier 
injection. The excess minority carriers diffuse away from the junction recombining with 
majority carriers as they do so. Using eq. (2.42), we may write the excess electron concen- 
tration A/?(.v) in the p material as a function of distance x from the edge of the depletion region 
as 

An(x) = A/i(0)exp(-.v/L e ) 

The process is illustrated in Fig. 4.12. Ideally, in a light-emitting diode (LED) every 
injected electron takes part in a radiative recombination process and hence gives rise to an 
emitted photon. In practice this is not so, and the efficiency of the device may be described 
in terms of the quantum efficiency, which is defined as the rate of emission of photons divided 
by the rate of supply of electrons. In reverse bias, no carrier injection takes place and con- 
sequently no light is emitted. The current-voltage {i, V) relationship for a diode can usually 
be written (see eq. 2.51 and section 2.8.5) as 



' = ''{ eXP (p^)" 1 
where i tt is a constant (the reverse saturation current). 



(4.4) 
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FIG. 4.12 Injection of minority carriers and subsequent radiative recombination with the majority carriers 
in a forward-biased p-n junction. 

The number of radiative recombinations that take place is usually proportional to the carrier 
injection rate and hence to the total current flowing. If the transitions take place directly 
between states at the bottom of the conduction band and at the top of the valence band, then 
the emission wavelength X is given by (see eq. 4. 1 ) 

hc/\=E v -E v = E K 



Therefore 



(4.5) 



For example, GaAs has an energy bandgap of 1 .43 eV, which corresponds to a value for X g 
of 0.86 urn. In fact, because of thermal excitation, electrons in the conduction band have a 
most probable energy which is kT/2 above the bottom of the conduction band (see Problem 
2.20). Band-to-band transitions therefore result in a slightly shorter emission wavelength than 
given by eq. (4.5), and self-absorption can further distort the situation. However, as we shall 
see later, most transitions involve energy levels within the energy gap, and for these eq. (4.5) 
represents the shorter wavelength limit. We now consider the transmission process in more 
detail. 



4.6.1 Radiative recombination processes 

Radiative recombination in semiconductors occurs predominantly via three different pro- 
cesses, namely (a) interband transitions, (b) recombination via impurity centres and (c) 
exciton recombination. 



4.6.1.1 Interband transitions 

This recombination process is illustrated schematically on an energy level diagram for both 
direct and indirect bandgap materials in Figs 4.13(a) and (b) respectively. {E-k diagrams 
for silicon and gallium arsenide are shown in Fig. 2.6.) It is important to realize that the tran- 
sition must conserve the total wavevector of the system. The photon wavevector is given 
by 2n/\, whilst the electron wavevectors involved range between approximately -n/a and 
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(a) (b) 

FIC. 4.13 Intel-band transitions for U) a direct bandgap semiconductor and (b) for an indirect bandgap 
semiconductor. In the former case, there is no change in the electron k value, whilst in the latter case there 
is. 



+n/a, where a is the crystal lattice spacing (i.e. the k values at the first Brillouin zone bound- 
aries; see Fig. 2.5b). For visible radiation, X~ 0.5 x 10" 6 m, whereas crystal lattice spacings 
are approximately 10~ 10 m: hence In/X^n/a, The photon wavevector is thus much smaller 
than the magnitude of the maximum possible electron wavevectors. Consequently if the only 
particles involved are an electron and a photon, then the electron must make a transition 
between states having virtually the same wavevector. On an E-k diagram, therefore, only 
vertical transitions are allowed. It is possible to have non-vertical transitions (as illustrated 
in Fig. 4. 1 3b) but to conserve the wavevector, a phonon must be either created or destroyed 
at the same time. The equation for the wavelength of the emitted photon is then given by 

hc/X = E t ± £ p (46) 

where E p is the phonon energy. The + and - signs correspond to phonon annihilation and 
creation respectively. Phonon energies are of the order of 0.01 eV, and hence the photon 
wavelength in fact differs little from X u . However, because now three particles are involved 
instead of just two for the direct transition process, the transition is much less probable. We 
know from the discussion in section 2.4 that the interband recombination rate r may be written 

r = Bnp (4 7) 

where B is a constant. Calculated values for B in various semiconductor materials are shown 
in Table 4.1 (see ref. 4.7), from which we see that the values of B for indirect bandgap mate- 
rials are some 10 h times smaller than for direct bandgap materials. We conclude that band- 
to-band radiative transitions in indirect bandgap semiconductors are relatively rare, and unless 
some other radiative transition mechanisms are possible these materials will not be suitable 
for LEDs. 
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One disadvantage of radiation derived from direct bandgap recombination is that the 
probability of the emitted radiation being reabsorbed in band-Io-band transitions can be 
high when the radiation has to traverse an appreciable thickness of the semiconductor 
material. 



4.6. 1.2 Impurity centre recombination 

Three types of recombination involving impurity energy levels are shown in Fig. 4.14. lnus 
we may have (a) conduction band-acceptor level transitions, (b) donor level- valence band 
transitions and (c) if in addition a pair of donor and acceptor states arc close together, then 
donor-acceptor transitions are possible. When the electron is in either type of impurity state, 
then it will be fairly strongly localized. This spatial localization implies that the electron can 
have a range of momentum values, since uncertainties in both position (AaO and momen- 
tum (A/;) are related via the Heisenberg uncertainty relation (eq. 2.3) 

Ax Ap > h/2 
Because the magnitudes of the electron wavevector k and the momentum p are related by 
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FIG. 4.14 Three types of recombination involving impurity energy levels are shown, in (at an electron 
moves from the conduction band into an empty acceptor level. In (b) an electron in a donor level 
recombines with a hole in the valence band. In (c) an electron in a donor level falls into an empty acceptor 
level. This latter process requires that the donor and acceptor levels are physically close together. 



k = p/h we may write 
1 



2A.v 



(4.8) 



We express the electron localization by putting Ax = Na, where a is the lattice spacing and 
N a number which is not expected to be much larger than unity; whence AA.S \/{2Na). In 
our discussion at the start of section 4.6. 1 . 1 , we noted that electrons in the conduction and 
valence bands have a range of A- values extending approximately from -n/a to +n/a. Hence 
we see that there may be sufficient spread in the k values of the impurity states to allow a 
significant number of transitions between them and the band extrema without calling on the 
assistance of phonons. Transitions via impurity states therefore provide a possible mecha- 
nism whereby indirect bandgap semiconductors can increase their radiative efficiency. 

Typical values for £ t . - E A and £ a - £ v are of the order of 0.02 eV, so that the emission 
wavelength will be slightly longer than that given by eq. (4.5). However, thermal excita- 
tion within the bands themselves will tend to decrease the magnitude of this effect (see the 
discussion following eq. 4.5). Since the radiation has a lower photon energy than that required 
to excite an electron across the bandgap, it is not subject to reabsorption to the same extent 
as that derived from band-to-band recombinations. 



4.6.1.3 Exciton recombination 

Exciton states exist within the energy gaps of even pure semiconductor materials (they must 
not be confused with impurity donor or acceptor states). Wc may visualize such states as being 
akin to Bohr-like states in which an electron and a hole circle round their common centre of 
gravity at relatively large distances (Fig. 2.12). The electron and hole are relatively weakly 
bound and the exciton states are situated just below the bottom of the conduction band. In 
Chapter 2 we showed (eq. 2.28a) that we may write the exciton binding energy £ e as 



E e =13.6-M-| eV 



(4.9) 
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where m* is the reduced mass. For example, in GaAs we have e r = 1 1.5. ///* = 0.068/?! and 
m* = 0.47///. whence m* = 0.06/// and £ L , = 5.9 meV. Observed experimental values are in 
reasonable agreement with this simple model calculation; in GaAs the exciton binding energy 
is found to be 4.8 me V. The exciton is capable of movement through the lattice, although, since 
exciton energies can be affected by the presence of impurities, in some circumstances an exciton 
may remain 'bound' in the vicinity of the impurity. If the impurities are neutral donors or accep- 
tors then the exciton binding energy is usually about one-tenth of that of the centres to which 
they are bound. (The binding energy may be much larger than this at isoelectronic traps, which 
are discussed in section 4.6.3.) Bound exciton states may be sufficiently well localized so that 
electron-hole recombination can take place in indirect bandgap semiconductors via these states 
without the need for phonon intervention to conserve the wavevector. 

4,6.7.4 Emission linewidths 

At a finite temperature electrons in the conduction band of a semiconductor are distributed 
amongst the energy levels with a probability of occupancy given by the Fermi distribution 
function. At the end of section 2.5 it i.s shown that if we denote the number of electrons per 
unit volume which have energies between E and E + d£ by n{E) d£ then 



fr l K r \ kT j 

The function n{E) is sketched in Fig. 2.14(c), and it can be shown (Problem 2.20) that it has 
a halfwidth of about 2kT (the actual value is close to \MT). A similar result can be obtained 
for holes in the valence band. Thus assuming direct band-to-band transitions we would expect 
a range of emission frequencies, Av, which have a halfwidth of about IkTJh. This may be con- 
verted to the equivalent wavelength spread A^, by use of the relationship \v = r. so that 

dv r 

dX.„ k<-> 

and hence 

IkTXl 

AK = ~~ (4.10) 

he 

At a wavelength of 900 nm and a temperature of 300 K, eq. (4. 1 ()) predicts a halfwidth of 
38 nm which agrees reasonably well with what is observed in practice (see e.g. Fig. 4.15). 
The result remains reasonably valid even when the transitions involve impurity states rather 
than direct band-to-band transitions. 



4.6.2 LED materials 

We may summarize the main requirements for a suitable LED material as follows: first, it 
must have an energy gap of appropriate width; secondly, both p- and n-types must exist, 
preferably with low resistivities; and finally, efficient radiative pathways must be present. 
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FIG. 4.15 Spectral output of an Si-doped and a Zn-diffused GaAs LED. 



Equation (4.5) indicates that, in order to obtain visible radiation, energy gaps greater than 
or equal to about 2 eV are required. Unfortunately, materials with such large gaps tend to 
have high resistivities even when doped. Furthermore, in most cases the wider the energy 
gap, the greater are the difficulties met in material preparation. This is often because the mate- 
rials have high melting temperatures and low structural stability. 



4,6.3 Commercial LED materials 

GALLIUM ARSEMDE (GaAs) 

This is a direct bandgap semiconductor with £ g = 1.443 eV (?c g = 860 nm); suitable p-n 
junctions may be made by diffusing zinc into crystals of n-type GaAs, The resulting radia- 
tion arises from band-to-band transitions, however, and is consequently subject to heavy 
reabsorption; this reduces the device efficiency and shifts the peak emission wavelength to 
about 870 nm. More efficient diodes may be made by using silicon as a dopant, where, 
depending on the growth conditions, either p or n material is obtained. Complex acceptor 
levels are also formed about 0.1 eV above the valance band. Transitions between these and 
the conduction band give rise to radiation with a peak emission wavelength of about 1000 nm 
which is not subject to reabsorption. Typical emission spectra for both Zn-diffused and Si- 
doped diodes are shown in Fig. 4.15. 

GALLIUM PHOSPHIDE (GaP) 

This is an indirect bandgap semiconductor with £ g = 2.26 eV (^ = 549 nm) and hence band- 
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to-band transitions are rare. Group V elements such as N and Bi may be used as dopants to 
assist radiative transitions. These replace the phosphorus atoms and form recombination 
centres called isoetcctronic traps. In the case of nitrogen the effective trap depth below the 
conduction band is small (8 meV) and the subsequent radiation has a peak wavelength only 
slightly less than \. Using increased levels of nitrogen doping and also doping with zinc 
and oxygen simultaneously give rise to deeper traps and consequently higher emission wave- 
lengths. 

GALLIUM ARSENIDE PHOSPHIDE (GaAs^P,) 

The energy gap of this ternary alloy depends on the value of .v and furthermore it changes 
from being a direct bandgap when ,v< 0.45 to an indirect bandgap when ,v > 0.45. Using diodes 
with .v = 0.4 results in red emission (Fig. 4. 16). The indirect bandgap material can also be 
used in conjunction with the same radiative assisting dopants as used in GaP. 

GALLIUM ALUMINIUM ARSENIDE (Ga x AI,_, As) 

Highly efficient red- and near-infrared-emitting LEDs can be made from this material. If a 
heterojunction is formed between n-type Ga (U Al <)7 As and p-type Ga< Ul Al (U As, electrons 
injected from the n surface layer into the p material recombine radiatively via acceptor levels 
and result in radiation of 650 nm wavelength. This can pass through the surface layer with 
little attenuation because of the relative large bandgap of the latter. 

tir-v nitrides (e.g. GaN and AIN) 

These materials have energy gaps that correspond to emission wavelengths from green all 
the way into the ultraviolet. A number of problems frustrated the early development of these 
materials; for example, until recently there was a lack of suitable substrate materials with 
matching lattice constants and thermal expansion coefficients. It also proved difficult to dope 
the materials p-type. It was not until the early 1980s that these problems were overcome, 
when it was found that the deposition of buffer layers allowed growth on readily available 
substrates such as sapphire and silicon carbide. In addition new growth and irradiation tech- 
niques have enabled p-type layers to be fabricated. 

INDIUM GALLIUM ARSENIDE 

High brightness LEDs are now commercially available based on InGaAs, although the struc- 
tures (e.g. 'double heterostructures', see sections 2.8.6 and 5.10.2.3) are somewhat more com- 
plicated than those discussed so far. By increasing the indium content the emission wavelength 
increases, and radiation in the green can also be obtained with reasonably high efficiency. 

It-vi semiconductors (e.g. zinc selertide and related compounds) 

These materials have also been used successfully to make blue- and green-emitting diodes. 
For example, green-emitting diodes have been made using ZnTeSe as an active region grown 
on a ZnSe substrate. By replacing the ZnTeSe layer with one of ZnCdSe emission has been 
obtained in the blue. However, these materials are much softer than the III- V nitrides and 
degrade more rapidly at elevated temperatures and consequently have shorter working life- 
times. 
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TABLE 4.2 Characteristics of the most commonly used LED materials 



Material 


Dopant 


Peak 


emission (nm) 


Colour 


External quantum 
ellkiondes <%) 


CjAs 


Si 


910- 


-> 1 020 


Infrared 


10 


Ga.Al, _„As(1 <x<0.7) 


Si 


879- 


-^890 


infrared 


15 


GaP 


Zn, O 


700 




Red 


4 


GaAs 1M) r (14 




650 




Red 


0.2 


Ga (H Ai ()J As 


Zn 


650 




Red 


1 5 t 


CaA.s„ tr ,P tM> , 


N 


632 




Orange 


0.2 


GaP 


N, N 


590 




Yellow 


0.1 


AllnGaP 




5 70 




Yellow 


1 + 


GaP 


N 


555 




Green 


0.1 


Zn, ( ,,Cd, u Se 




489 




Blue 


1.3 + 


SiC 




4 70 




Blue 


0.0 3 


In 0(l(,C a 0<J4 AS 


Zn 


450 




Blue 


3.8 + 



+ Indicates a double heterostructure diode. 
SILICON CARBIDE (SiC) 

Silicon carbide has been a promising material for many years, but its high melting point 
and consequent growth difficulties prevented its early use in LEDs. It can be doped as both 
n- and p-type and commercial blue-emitting diodes have been available for some years. 
Doping with B, Al, Sc and Be gives rise to yellow, blue, green and red emission respec- 
tively. However, efficiencies have always been very low and with the recent development 
of highly efficient InGaAs emitters the future prospects for this material do not seem too 
good. 

Table 4.2 summarizes the characteristics of the most commoly used LED materials. 



4,6.4 LED construction 

A typical LED construction is shown in Fig. 4.16. It is obviously advantageous if most of 
the radiative recombinations take place from the side of the junction nearest the surface, since 
then the chances of reabsorption are lessened. We may ensure this by arranging that most 
of the current flowing across the diode is carried by those carriers that are injected into the 
surface layer. We assume that, as in Fig. 4.16, the surface layer is p-type. The fraction of 
the total diode current that is carried by electrons being injected into the p side of the junc- 
tion (ri e ) is then given by 



n e = 



D e n p /L e 



D e njL t + D hP jL h 



or 



1 + 






(4.11) 



This equation is readily derived from the results of section 2.8.2. By using the Einstein rela- 
tion (eq. 2.40), D eM = (kT/e)\i tM , and the relation n p p p = p n ii n = n; (eq. 2.36), eq. (4.11) 
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Electrical contact 

F1C. 4,16 Cross-section of a standard red-emitting LED chip based on GaAsP. A layer of n-type 
GaAs, )() P|) 4 (using tellurium as a dopant) is deposited by vapour phase epitaxy on a GaAs substrate. A p-n 
junction is then formed by diffusing in Zn through a surface mask. The small aluminium contact on the 
upper surface allows as much of the radiation to escape as possible; any radiation flowing downwards is 
almost completely absorbed by the GaAs. 




Point light source 

FIG. 4.1 7 Phenomenon of total internal reflection. When a beam of light is incident at an angle 9, onto 
an interface between two media of refractive indices n, and n 2 (n, < n,) then the refracted beam (beam 1) 
makes an angle 6 2 with the normal to the surface where n, sin 9, = n, sin 8, (Snell's law). At the critical 
angle (8, = 9,.), Q 2 = 90° (beam 2) and the refracted beam emerges along the interface. At angles of 
incidence greater than 9 C (beam 3), the beam is totally reflected back into the first medium. 



becomes 



^,= 






-i 
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(4.i: 



In III- V compounds p. L > u. h , and so, assuming that L e ^ L lv we see that there is a natural ten- 
dency for r| L . to be close to unity. We may reinforce this tendency by arranging that n t> p , 
(i.e. by making an n + -p diode). 

Although the internal quantum efficiencies of some LED materials can approach l()0</r. 
the externa] efficiencies are much lower. The main reason for this is that most of the emitted 
radiation strikes the material interface at an angle greater than the critical angle and so remains 
trapped. Unfortunately, the high refractive indices of the III — V materials discussed here give 
rise to small critical angles. Consider, for example, radiation from a point source within a 
medium of refractive index h, impinging on a plane interface with another medium of refrac- 
tive index //•,, where /?■><//,, as shown in Fig. 4.17. Only those rays (e.g. beam 1) that have 
an angle to the normal less than the critical angle (8 C ) enter the second medium. Those with 
angles greater than 9 C (e.g. beam 3) are reflected back into the first medium. 

From eq. (1.14), we have that the critical angle L . is given by 

6 t . = sin _1 (/7 2 /»|) (4.13) 

Light originating at recombination centres near the p-n junction will be radiated isotrop- 
ically, whereas only that within a cone of semiangle 8 L . will escape. In Problem 4. 1 , it is shown 
that the fraction/ 7 of the total generated radiation that is actually transmitted into the second 



Electrical 
contacts 




p-n junction 




Plastic 
encapsulation 



Electrical 
contacts 



FIG. 4.18 Two methods used to reduce reflection losses in LEDs. In (a> the p material is made into a 
hemispherical dome. More radiation then strikes the semiconductor/air interface at less than the critical 
angle than would otherwise be the case. In lb) the p-n junction is surrounded by plastic encapsulation. 
Losses at the plane semiconductor/plastic interface are then less than for a corresponding 
semiconductor/air interface. 
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medium is 



(4.14) 



There are two obvious ways to increase F\ the first is to ensure that most rays strike the 
surface at less than the critical angle. This may be achieved by shaping the semiconductor/air 
interface into a hemisphere, as shown in Fig. 4.18(a). However, although this technique is 
used occasionally in high power diodes, it is too difficult and expensive for most situations. 
The second, and much commoner, technique is to encapsulate the junction in a transparent 
medium of high refractive index. This is usually a plastic material with a refractive index 
of about 1.5. Using eq. (4. 14) with/?; = 3.6andn 2 = 1.5 we obtain/ 7 = 0.036, giving a nearly 
threefold increase in light output over the simple semiconductor/air interface (Example 4.2). 
Of course there will be some losses at the plastic/air interface, but these are easily minimized 
by mouding the plastic into an approximately hemispherical shape (see Fig. 4.18b). 

The diodes themselves may be fabricated using either vapour or liquid phase expitaxy; 
for more details the reader is referred to ref. 4.8. 



EXAMPLE 4.2 Transmission efficiency of a plane GaAs LED surface 



If we take a GaAs/air interface where n ] = 3.6 and n 2 = I, then the fractional trans- 
mission for isotropic radiation originating inside the GaAs is given by eq. (4.14) as 




0.013 



The basic reason for this low transmission efficiency for a GaAs/air interface may be 
better appreciated if we calculate the critical angle. From eq. (4.13) we have 
9 t = sin-'( 1/3.6) =16°. 



4.6.5 Response times of LEDs 

For most display purposes the fast response time obtainable from LEDs (< 1 (is) is not essen- 
tial. However, because of their importance in optical communications (see Chapter 9), it is 
convenient to pursue the matter here. There are two main factors which limit the speed with 
which an LED can respond to changes in the drive current. One of these is due to the effects 
of junction capacitance C- which arises from the variation in charge stored in the depletion 
region when the external voltage is varied. It was shown in section 2.8.4 that C l varies as 
the reciprocal of the square root of the applied voltage (assuming an 'abrupt' junction). 

The other limitation is due to what is sometimes termed 'diffusion capacitance', and results 
from the storage of mobile carriers within a diffusion length or so of the junction. When the 
external voltage is reduced, charge must diffuse away from the junction and disappear by 
recombination to enable the new equilibrium conditions to be attained. It is shown in 
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Appendix 3 that the frequency response resulting from this process may be written 

Here R(f) is the response at frequency/and x c is the minority carrier lifetime provided we 
have conditions of low level injection. For high level injection the concept of a constant life- 
time no longer applies (see the arguments leading to eq. 2.38). and we must assume some 
average value of x L .. However, in practice eq. (4.15) is found to be a good representation of 
the frequency response of most LEDs. It is evident that for a good frequency response we 
require x c to be as small as possible. From eq. (2.38) we have for p-type material that 

x c = (fip) _l (4.16) 

where/? is the majority carrier population (here assumed to be holes) and B is the constant tab- 
ulated in Table 4.1. We see that x c may be reduced by using highly doped material. 
Unfortunately if compounds such as GaAs are doped at near the solubility limit for acceptor 
impurities, then non-radiative centres are formed. Germanium is a fairly commonly used accep- 
tor impurity in GaAs and above a concentration of some 10 24 atoms itT 1 the external quantum 
efficiency starts to decline. At this concentration, the electron lifetime is given by eq. (4.16): 

Tc = (7x 10" 16 x 10 24 )" 1 = 1.4 x 10" y s 

An alternative approach to attaining short response times is to use lightly doped mater- 
ial with a narrow active region and to operate the diode under conditions of heavy forward 
injection. The injected electron and hole densities are then much greater than they are in equi- 
librium. If A/7 and Ap are the injected carrier concentrations of electrons and holes, then eq. 
(4.16) can be written 

x c = (Bbpy l (4.17) 

If the injection current density is / and the active region width is t, then in equilibrium the 
number of recombinations per second per unit volume must be Jjte. Since an excess popu- 
lation density Ap gives rise to Ap/x e recombinations per second per unit volume, we have 

/x e 

A^ = — (4.18) 

et 

Eliminating Ap from eqs (4.17) and (4,18), we obtain 

**-(:§)" (419) 

We see that in this case t,. may be reduced by reducing t and increasing /. However, since 
the lifetime is now current dependent, this approach can lead to signal distortion. 

4.6.6 LED drive circuitry 

As we have seen, the electrical characteristics of LEDs are essentially those of ordinary rec- 
tifying diodes. Typical operating currents are between 20 raA and 100 mA, whilst the forward 
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Simple LED drive circuits for (a) d.c. operation and !b) a.c, operation. In both cases a series 
limits the maximum current flow, in the ax, circuit a diode is pta< ed with reversed polarity 
LED to prevent damage from excessive reverse bias voltages: lei shows the li^ht output obtained 
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voltages vary from 1 .2 V for GaAs to 2 V for GaP. (The operating voltage is approximately 
equal to the built-in diode potential, which in turn is slightly less than the energy gap 
expressed in eV.) Simple drive circuits for d.c. and a.c. voltage operation are shown in Figs 
4.19(a) and (b) respectively. The current through the diode is limited by a series resistance 
/? s whose value may be calculated from 



R. = 



Vh-V d 



(4.20) 



where V b is the power source voltage. V (t the diode operating voltage and ; (l the desired diode 
current. In the a.c. circuit, a rectifying diode is placed across the LED to protect it against 
reverse bias breakdown. 

These two circuits provide for continuous 'on' operation. If it is desired to switch the diode 
on or off, or to modulate the output, then the circuits shown in Figs 4.20(a) and (b) respec- 
tively may be used. In Fig. 4.20(a) the transistor is used as a simple switch. With no voltage 
applied to the base, the transistor has a very high impedance between the collector and emitter 
and hence no current flows through the LED. If a large enough base voltage is then applied 
so that the emitter-base junction becomes heavily forward biased, the transistor has a rel- 
atively low impedance between emitter and collector and a substantial current can flow, 
resulting in the LED being turned on. In Fig. 4.20(b) the transistor is biased so that the qui- 
escent diode current is about half its peak value and both the transistor and the LED are biased 
well into their linear regions. Changes in the current flowing through the LED are then 
directly proportional to changes in the input voltage. 
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(a) 




(b) 

FIG. 4.20 LED modulation circuits: (a) provides for a simple on/off modulation via a switch. The voltage 
V, is sufficient to switch the transistor on. There is then a low impedance path between the collector and 
emitter terminals, and the current flowing through the LED is determined by the voltage V and the series 
resistance R,. In (b> the diode output may be modulated by voltage V m . The resistors R, and R, bias the 
transistor so that the average current through the transistor, and hence through the LED, is about half the 
maximum value. Both the transistor and the LED are then biased well into their linear regions. 



(^y) Plasma displays 



Plasma displays rely on the glow produced when an electrical discharge takes place in a gas 
(usually neon). Free electrons present in the discharge acquire high kinetic energies from 
the external field and when they collide with the gas atoms (or ions) they transfer this energy 
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FIG. 4.21 Construction of an a.c. plasma element. The gas cavity is some 1 "* m in width with 
transparent electrodes on the outside of the containing dielectric layers. The main constituent of the gas is 
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FtC. 4.22 Voltage waveform used for driving an a.c. plasma display. After an initial firing pulse V, the 
voltage may be reduced to V s for the remaining time the display is required to be on. 



to the atoms, thereby exciting them into energy levels above the ground state. The atom may 
then lose energy radiatively and return to the ground state. 

Either a.c. or d.c. excitation may be used, although the former is the most common. Figure 
4.2 \ shows the basic construction of an a.c. plasma display element in which the electrodes 
are external to the gas cavity. (When d.c. excitation is used, the electrodes must be inside 
the gas cavity.) Typical cavity widths and gas pressures are 100 u.m and 400 ton respectively. 
The discharge is initiated by applying a firing voltage V f of some 150 V; however, once the 
discharge has started, it may be sustained with a reduced voltage V s of about 90 V. A suit- 
able voltage waveform for this situation is shown in Fig. 4.22. 

If the initial voltage pulse is relatively wide then an appreciable amount of charge accu- 
mulates on the interior of the dielectric layer. This charge build-up causes an internal elec- 
tric field to be set up which, being in opposition to the external field, may be strong enough 
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to extinguish the discharge if the pulse width is too long. Such accumulated charge takes a 
long time to dissipate and this effect can be used to enable the device to have a memory. 
Thus, if at some time subsequent to the first ('writing 1 ) pulse an external voltage pulse of 
opposite polarity is applied, the field from the stored charge will add to the externa! field 
and the discharge will again commence, but now at a lower voltage than V ( . 



C 4 - 8 ) Display brightness 

The basic photometric units of brightness were introduced in section 1.4, and we briefly 
review the topic again here. The sensitivity of the human eye to different wavelengths was 
shown in Fig. 1.24, which enables us to convert objective radiometric units into subjective 
photometric units. Thus, if the radiant power in the wavelength interval X to A.+ d\ (nm) is 
W(k) dX (watts), then the total luminous flux <E> (lumens) is given by 

p7X(> 

= 680 W(\)V % d\ (4.21) 

where V > is the relative response of the eye as shown in Fig. 1.17. This function has its 
maximum at a wavelength of 555 nm, where the eye's absolute sensitivity is 680 1m W _1 . 

A calculation of <J> enables the conversion efficiency (in Im W _l ) of electrical power into 
effective visible radiation to be determined. However, the apparent brightness of a display 
element depends on the emission area and the angle from which it is observed. If the element 
is sufficiently small for it to appear as a point source (which is the case if it subtends an angle 
at the eye of some 2 minutes of arc or less), then a meaningful measure of brightness is the 
flux per unit solid angle in the direction of the observer. This is called the luminous inten- 
sity of a source and the units are lumens per steradiun or candela. For sources with an appar- 
ently finite area, the brightness is more properly referred to as the luminance and the 
appropriate units are lumens per steradiun per unit projected area of emitting surface, that 
is candelas m' 2 or nits. 

The variation in luminance with viewing angle is something that must be determined 
experimentally for each type of display. It is often assumed that emitters show either isotropic 
emission or Lambertian emission. If the luminance of a surface viewed at an angle 9 to the 
normal is fi(9), then for isotropic emission B(Q) = constant = fl(0). For Lambertian emission, 
on the other hand, B(6) = B(0) cos(O). It is readily shown (see Problem 4.5) that for an 
isotropic surface of area A the total flux <£, is given by 

<£_ = 2nAB(0) (4.22) 

whilst for a corresponding Lambertian surface 

<J> l = 7l45(0) (4.22a) 

Quite a number of displays are Lambertian, but there are some, for example GaP red LEDs, 
that approximate more to isotropic emitters. 
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EXAMPLE 4.3 Brightness of an LED 



We take an LED with a chip diameter of 0.2 mm which is viewed from a distance of 
I m. It emits at a wavelength of 550 mn and has an external quantum efficiency of 
0. 1 %. We further assume that the emission is isotropic and that the diode is operated 
at 50 mA. We must first decide whether the diode acts as a point or as an extended 
.source at the eye. The emitting area subtends an angle 9 where 

e 2xio' 4 



tan 



2 2x1 



whence 6< 1 minute of arc, and the LED acts as a point source. 
The total radiant power output W is given by 

W = hcfk x quantum efficiency x photon emission rate 

he (50x10"') 
x (0.001) x 



550 x 10 



Therefore W= 1.13 x I0 - " 1 watts. 

Using Fig. 1 .24, we take an average luminosity at 550 nm of 600 ImW" 1 ; hence the 
luminous flux from the source is 1 . 1 3 x 1 0" 4 x 600, that is 6.8 x 1 0~ 2 lm. Isotropic emis- 
sion means the flux is uniformly distributed over the solid angle 2rc, and so the lumi- 
nous intensity at normal incidence is 6.8 x 10~ 2 /27i = 1.1 x I0~ 2 candela. 

Another brightness unit sometimes encountered is the foot-Lambert (ft-L). This is obtained 
by dividing the total luminous flux (in lumens) by the area of the device in square feet. This 
definition assumes a Lambertian emission pattern. To convert nits or candelas m" 2 into foot- 
Lamberts (for a Lambertian surface) we multiply by the factor 0.292 (this factor may be 
derived from eq. 4.22a). 

The brightness level required of a display will depend on the ambient lighting conditions. 
In a dimly lit room, a brightness of about 10 nits may suffice, whilst in a well-lit environ- 
ment about 1000 nits may be needed, although the difference between the display on and 
off brightnesses also plays an important role. A more thorough discussion is given in ref. 4.9. 



( . 4 - 9 y Liquid crystal displays 

We turn now to the most important of the 'passive' types of display (and indeed the only 
one of this type we shall mention), namely liquid crystal displays (LCDs). These have come 
into prominence in the last few years mainly as display elements for digital watches and 
pocket calculators. Here one of the prime requisites is for low power consumption, partic- 
ularly so for the digital watch because of the necessarily low capacity of the power source. 
LCDs consume the least power of all common display devices because no light generation 
is required. This, as we have seen in the LED, for example (Table 4.2), is a very inefficient 
process; at best the efficiency is only some 10%. There are two basic types of LCD avail- 
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able. These are (a) reflective, which requires front illumination, and (h) transmissive, which 
requires rear illumination. Most reflective types utilize ambient light for illumination with 
provision for secondary illumination via a small incandescent lamp or LED if ambient levels 
become too low. At the heart of all LCD devices is a cell formed between two glass plates, 
each with a conductive coating. The cell has a thickness of about 10 jam (sometimes less) 
and is filled with a liquid crystal material. 

The liquid crystal state is a phase of matter which is exhibited by a large number of organic 
materials over a restricted temperature range. At the lower end of the temperature range, the 
material becomes a crystalline solid, whilst at the tipper end it changes into a clear liquid. 
Within this range it has a milky yellowish appearance and combines some of the optical prop- 
erties of solids with the fluidity of liquids. A major characteristic of all liquid crystal com- 
pounds is the rod-like shape of their molecules. When they are in the liquid crystal phase. 
these molecules can take up certain orientations relative both to each other and to the liquid 
crystal surface. It is usual to describe this orientation in terms of a director, that is a unit vector 
pointing along the time-averaged preferred orientation of the molecules in any small volume. 

There are three basic types of ordering in liquid crystals, which are termed nematie. cho- 
lesteric and smeaic. Only the first two of these are of importance in display devices at present 
and are illustrated in Fig. 4.23. In nematie ordering, the molecules (or, rather, the directors) 
are aligned parallel to each other, but apart from remaining parallel the molecules are free 
to move relative to each other so that the phase has liquid properties. A nematie liquid crystal 
molecule usually consists of two benzene rings linked with a central group. A typical example 
is 4-methoxybenzyhdene-4-butyIanaline (MBBA), which has the chemical formula 
CH,— O— G— CH=N— Q— CfM,. MBBA shows liquid crystal behaviour over the tem- 
perature range 20°C to 47°C. For more general details concerning liquid crystal materials 
the reader is referred to ref. 4.10. 

In the cholesteric phase (Fig. 4.23b), we may regard the material as being made up from 
a large number of planes each having a nematic-like structure, but with each plane showing 
a progressive change in the director direction from the one below. The director directions 
thus display a helical twist through the material. The distance between planes having the 
same director direction is called the pitch, /;. Cholesteric liquid crystals exhibit some inter- 
esting colour effects. If, for example, light of wavelength A. is incident normally on the direc- 
tor planes, then strong Bragg reflection will occur when p = mX (in an integer), but not 
otherwise. Thus if white light is shone onto a cholesteric liquid crystal, it can appear strongly 
coloured. Furthermore, the pitch is usually temperature dependent, so that the colour of the 
reflected light will also be temperature dependent. Obviously this can form the basis of a 
thermometer. Most liquid displays, however, are of the so-called twisted nematie type, and 
we shall spend the rest of this section discussing them. 

When a nematie liquid crystal material comes into contact with a solid surface, the direc- 
tors often become aligned either perpendicular to the surface (houwotropic ordering) or par- 
allel to the surface (homogeneous ordering). These two forms can be produced by suitable 
treatment of the surface. In the case of homogeneous ordering, this can often be achieved 
by rubbing the surface once or twice along a particular direction with a soft fabric (e.g. cotton) 
before it comes into contact with the liquid crystal material. The liquid crystal directors then 
take up an orientation parallel to the direction of rubbing. 
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FIG. 4.23 (a) Nematic ordering and (b) cholesteric ordering. In (a) the directors all line up parallel to each 
other, tn (b) a large number of planes of nematic order are formed where the directors rotate as we move 
along a direction perpendicular to the planes. 



One of the most important electrical characteristics of liquid crystal materials is that they 
show different dielectric constants e„ and e i , depending on whether the external field is par- 
allel to, or perpendicular to, the molecular axis. If e„ > e x we refer to it as a positive material. 
The application of an external electric field to a positive material will tend to make the mol- 
ecules lie along the electric field, since this will tend to minimize their energy. We see that 
there is thus a possibility of changing the homogeneous type of ordering into a homeotropic 
type by the application of a field which is perpendicular to the surface (assuming a positive 
material). This transition is found to take place above a critical field ('£ J and is illustrated 
in Fig. 4.24. 

The most common liquid crystal display uses a 'twisted nematic' cell. In this, the oppo- 
site walls of the cell are treated to produce a homogeneous arrangement in which the mol- 
ecular alignment directions at the walls are at right angles to each other. Thus the molecules 
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FIC. 4.24 Behaviour of molecules in an initially homogeneously ordered liquid crystal material as an 
increasing electric field of magnitude I is applied in a direction perpendicular to the liquid crystal/solid 
interface. If 'f is less than a critical value ('/,} the ordering is not affected (a). If f. ;>-£ ( the molecules furthest 
away from the interface begin to align along the field direction (b). If £ *./., then most of the molecules are 
aligned along the field direction (c). 
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FIG. 4.25 Behaviour of the molecules in a liquid crystal cell of thickness D with (a) no applied voltage 
(g = 0) and (b) with a voltage applied such that t. s» 7., . In the former the molecules undergo a 90° rotation 
across the cell, in the latter they are mostly ordered with their axes parallel to the applied field. 

undergo a 90° rotation across the cell as shown in Fig. 4.25(a). When a beam of polarized 
light is incident on the cell the strong optical anisotropy of the liquid causes the polariza- 
tion to undergo a 90° rotation. With a strong enough electric field across the cell, however 
(i.e. % > C( U- the molecular alignments will become as shown in Fig. 4.25(b) and in this state 
the molecular alignments will have no effect on an incident polarized light beam. 
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Polaroid LCD cell 

FIG. 4.26 Illustration of the action of an LCD display device. 



Polaroid Mirror 



In operation, the cell is sandwiched between two pieces of polaroid whose polarizing direc- 
tions correspond to the director ordering direction of the particular cell surfaces they are next 
to. In the reflective mode a reflector is placed behind the back sheet of polaroid. Figure 4.26 
shows the arrangement and traces the behaviour of a polarized beam as it traverses the system. 
With no applied voltage, the incident light is first polarized, then has its polarization direc- 
tion rotated by 90° as it traverses the cell, then passes through the second polarizer and is then 
reflected back along its path where the same process is repeated. With no field applied, there- 
fore, the device reflects incident radiation and appears bright. When a field is applied the direc- 
tion of polarization of light traversing the cell i.s not rotated and hence cannot pass through 
the second polarizer. Little light will then be reflected from the device and it will appear dark. 

The amount of light reflected from an LCD as a function of applied voltage is shown 
schematically in Fig. 4.27. The reflectance, initially constant, falls rapidly beyond a criti- 



Reflected 
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V ( K sal Applied voltage, V 

FIG, 4.27 Amount of light reflected from a liquid crystal display as a function of applied voltage V. The 
reflected irradiance remains constant up to the critical voltage V ; it then falls with increasing voltage until it 
again becomes constant beyond V M1 . 
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cal voltage V c (= f L D; D = cell thickness) and again becomes constant beyond a volume V 
A typical value for V\ :it is 3 V. D.C. operation tends to shorten the operating lifetinicof the 
device owing to electromechanical reactions taking place, and hence a.c/wavelorms are 
invariably used. The celt responds to the r.m.s. value of the voltage waveform. A square 
waveform which has a frequency of between 25 Hz and I kHz is often used. 

Transmission LCD displays do not have the reflector, and must be provided with rear illu- 
mination, but otherwise they operate in a very similar fashion to the reflective displays. 
Colour displays are possible by incorporating a colour filter. The use of polarizers in the 
twisted nematic cell substantially reduces the maximum amount of light that can be reflected 
from it (see Problem 4.1). In addition, the angle of viewing is found to be restricted to about 
±45°. A greater image contrast over a wider range of viewing angles can be obtained by 
increasing the angle of twist to 270° to give the so-called 'supertwist' display. The Ii»ht reflec- 
tion curve shows a much more rapid switch from on to off states (i.e. the corresponding V. 
and V s . tt values are much closer together than for the 90° twist curve (Fig. 4.27)). A disad- 
vantage of this is that the switching times are longer than for the 90° twist and the cell width 
has to be reduced to compensate. 



(4- to ) Numeric displays 



We turn now to the problem of combining our individual display elements into some pattern 
capable of conveying more information than just a simple 'on' or 'off situation. There is 
no doubt that for medium area, high resolution displays there is as yet no ready alternative 
to CRTs. These do, however, have the obvious disadvantages of a small display area to 
volume ratio, of requiring a high voltage supply and of being sensitive to adverse'environ- 
mental conditions (e.g. stray magnetic fields). 

Displays that require only a small number of basic elements are easily catered for by using 
LED, liquid crystal or plasma display elements. One of the simplest display formats com- 
monly used to form the numbers to 9 consists of seven bar segments and 'is illustrated in 
Fig. 4.28(a). Each bar might itself consist of several discrete display elements depending 





(a) (b) 

FIG. 4.28 Two common display format;.: fa) illustrates the seven-bar segment display used to form the 
numbers from to 9; the 7 x 5 matrix display (b) may be used for mure complex characters. 
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on its size. More complex characters may be obtained using a 7 x 5 matrix (Fig. 4.28b). A 
display of this latter type can be made either by assembling 35 discrete elements or, in the 
case of LEDs, a monolithic device can be constructed with all the elements grown onto a 
single substrate. This latter alternative is only feasible for smaller displays where the char- 
acters are less than 5 mm or so in size. 

There are two basic methods of wiring up a matrix display. The simpler of these is known 
as the common anode (or cathode) and is shown in Fig. 4.29(a). The anodes (or cathodes) 
are all wired together whilst each cathode (or anode) has a separate connection made to it. 
Thus for N elements the number of external connections is N+ 1. A technique requiring fewer 
connections is the coordinate-connected (or matrix-connected) method shown in Fig. 4.29(b). 
In this, all the anodes of the elements in each column (row) are connected together as are 
all the cathodes of the elements in each row (column). For a square array of ;V elements we 
thus require N 1/2 external connections. If N is large, this number is considerably less than 
that required for the common anode method. It is obvious, however, that a general display 
pattern cannot be maintained merely by applying static voltages to the leads. The array must 
be scanned. Thus if we label the columns (anodes) by a,, x 2 , .v v etc., and the rows (cath- 
odes) by >',, v 2 , >' 3 , etc., then a possible scanning sequence is as follows: a voltage +V is 
applied to column .v, and zero volts to the rest. Voltages of either -V or +V are then applied 



individual cathode (or anode) connections 




Display 

element 
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(b) 

FIG. 4.29 Common anode (or cathode) method of connecting an array of display elements (a). For N 
elements there will be N + 1 connections, (b) The coordinate-connected method of wiring up an array of 
display elements. The columns are labelled x,, x 2 , x,, etc., and the rowsy,, y,, y s , etc. 
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simultaneously to the rows v,, v 2 , v„ etc., the choice depending on whether we require the 
element at the intersection of column .v, and the particular row to be on or off. After a time 
t c the voltage +V is switched to column .v, and a new set of voltages applied to the rows. 

If the number of columns is N c then the picture will be scanned in a time /V L / C . To avoid 
flicker, we must have Nv c < l/45 s and hence / c s l/(45A/ t .) s. If N c is very large, / may 
approach the response time of the device which will result in a reduced device output. 
Because each element in the display is on for at most a fraction \/N c of the total 'on' time 
of the display then the active display elements must be overdriven by a factor /V o to main- 
tain the same average brightness levels possible from an equivalent common anode display. 
This places a limitation on the size of A/ c unless a much reduced brightness is acceptable. 

It will be noticed that even when an element is 'off it still may have a voltage V across 
it. An 'ideal' element should thus show no output up to a threshold voltage V lh and then give 
its maximum saturated output at a voltage V saI , where V s;il = 2 x V th (see Fig. 4.30). Several 
devices do have characteristics that are not too far removed from this ideal, for example the 
LED and d.c. electroluminescent cells. Often, however. V saI is greater than 2V th and so it is 
not possible to drive the device to saturation, thus reducing the maximum brightness avail- 
able. In addition, V th may vary with operating temperature and a 'safe' value somewhat below 
the actual value may have to be adopted, again leading to a reduced output. 

Although LCDs have the attraction of requiring very little power, they have so far proved 
rather unsuitable for matrix displays. They have rather ill-defined threshold voltages which 
are quite strongly dependent on the operating temperature. In addition, their slow response 
times prevent them from responding fully to short voltage pulses. Nevertheless it is poss- 
ible that new materials will be developed which will in part overcome some of these diffi- 
culties. (For example, the positive cholesteric materials mentioned in the previous section 
have a faster response time than the more usual nematic types.) 



Optical 
output 




Drive voltage V iat 

FIG. *-30 Idealized characteristic for a device suitable for matrix-addressed displays. The device shows no 
optical output up to a threshold drive voltage V lh . Thereafter the output increases to a maximum at a drive 
voltage of V^,. 
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FIC. 4.31 Method used for LCD matrix operation using an MOS transistor. Trip element may be held 'on' 
(or 'off') during the time between addressing by virtue of the charge held on the drain terminal. (The LCD 
consumes minimal power.) To address an element the gate of the attached MOST is 'opened' by applying 
a suitable potential to the appropriate y line. The voltage that is on the corresponding* line then appears 
across the device. Note: only one connection to each device is shown here; there will be a further 
connection common to all elements. 

It is possible, when using an LCD, to hold each element on for a time longer than the 
element address time. This may be achieved using the circuitry shown in Fig. 4.31. Here 
each element is connected to the drain of an MOS transistor 1 whose source and gate are con- 
nected to the appropriate column and row respectively. If the clement is to be 'on' then the 
voltage on the corresponding column is switched onto the element by applying a voltage 
pulse to the appropriate row. Since the LCD element consumes very little current, the drain 
capacitance of the MOST may enable the element to remain 'on' during the rest of the scan 
time when the MOST is switched 'off. There is, however, the obvious difficulty of reliably 
fabricating the required number of MOS transistors if N is large. 



NOTES 



1. For those unfamiliar with the characteristics of an MOS transistor, further details may 
be found in any of the many books available on solid state devices, for example rets 2.1a 
andd. 
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PROBLEMS _ 

4.1 Assuming an isotropic radiation source within a transparent medium of refractive index 
//,. show that the fraction F of the radiation that can escape through a plane boundary 
into another medium of refractive index /;, (»,</; t ) is given approximately by 

1 



7- 



■ n, 



n , + n-, 



(Hint: assume that when radiation is incident on the boundary at an angle 9 (less than 
the critical angle), then the transmittance 7(6) is identical with that obtained at normal 
incidence, i.e. 7(9) = 7(0)= 1 -[(/;, -n^/O^+n,)] 1 .) 

4.2 A GaAs LED fabricated from fairly lightly doped materials has an effective recombi- 
nation region of width 0.1 urn. If it is operated at a current density of 2x K) 7 Am" 2 
estimate the modulation bandwidth that can be expected. 

4.3 Design a simple power supply using a 9 V battery to drive an LED which has a maximum 
forward current of 20 mA at 1.9 V. 

4.4 Estimate the average luminance of a CRT screen of area 0.05 nr when it is operated 
with a beam current of 1 uA and an accelerating potential of 15 kV. Assume the 
phosphor used has a power efficiency of 10% and peak emission at a wavelength of 
500 nm. 

4.5 A light source of area A has a luminance at normal incidence of 5(0) nits. Calculate the 
total flux from the source if (a) it is isotropic and (b) it is Lambertian. 

4.6 An isotropic source is embedded in a transparent medium of refractive index //,. Show 
that when the radiation passes into a second medium of refractive index /?-,, then the 
source appears to be approximately Lambertian. You may make the same assumptions 
concerning the transmission of radiation through the interface as are made in Problem 
4.1. 

4.7 Unpolarized light is incident normally onto an LCD cell; if there is no applied voltage 
what is the maximum fraction of the incident light that can be reflected back? 
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CHAPTER5 

Lasers I 



The word 'laser' is an acronym for '/ight amplification by stimulated emission of radiation'. 
Albert Einstein in 1917 showed that the process of stimulated emission must exist, but it 
was not until 1960 that T. H. Maiman (ref. 5.1) first achieved laser action at optical fre- 
quencies in ruby. The basic principles and construction of a laser are relatively straightfor- 
ward and it is somewhat surprising that the invention of the laser was so long delayed. A 
rigorous analysis of the physics of the laser, on the other hand, is quite difficult and the treat- 
ment we give below is somewhat simplified. The development of lasers since 1960 has been 
extremely rapid and although applications for lasers had a very slow start during their first 
decade, new applications for laser radiation are being found now almost every day (see 
section 6.5); a selection of texts on laser theory and applications is given in ref 5.2. In view 
of the increased use of lasers, and as laser radiation is potentially hazardous, some comments 
on laser safety have been included as Appendix 7. 



( f 5.1 y Emission and absorption of radiation 

It was seen in Chapter 1 that when an electron in an atom undergoes transitions between 
two energy states or levels it either emits or absorbs a photon, which can be described in 
terms of a wave of frequency v where v = A£//i, A£ being the energy difference between 
the two levels concerned. Let us consider the electron transitions which may occur between 
the two energy levels of the hypothetical atomic system shown in Fig. 5.1. If the electron 
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FIG. 5.1 Two-energy-level system. 
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FIG. 5.2 Energy level diagram illustrating (a) absorption, (b) spontaneous emission and (c) stimulated 
emission. The black dot indicates the state of the atom before and after the transition. 



is in the lower level E , then in the presence of photons of energy (£ 2 - E , ) it may be excited 
to the upper level E 2 by absorbing a photon. Alternatively if the electron is in the level E 2 
it may return to the ground state with the emission of a photon. The emission process may 
occur in two distinct ways. These are (a) the spontaneous emission process in which the elec- 
tron drops to the lower level in an entirely random way and (b) the stimulated emission 
process in which the electron is 'triggered' to undergo the transition by the presence of 
photons of energy (E 2 -E ] ). There is nothing mystical in this, as the electron would undergo 
this process sooner or later spontaneously; the transition is simply initiated by the presence 
of the stimulating photon. 

The absorption and emission processes are illustrated in Figs 5.2(a), (b) and (c). Under 
normal circumstances we do not observe the stimulated process because the probability of 
the spontaneous process occurring is much higher. The average time the electron exists in 
the excited state before making a spontaneous transition is called the lifetime x 21 of the 
excited state. The '21* here indicates the energy levels involved. The probability that a par- 
ticular atom will undergo spontaneous emission within a time interval dr is given by 
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A-,, d/ = d//x 21 , where A 2l is the spontaneous transition rate. Because the spontaneous radi- 
ation from any atom is emitted at random, the radiation emitted by a large number of atoms 
will clearly be incoherent. In contrast to this, the stimulated emission process results in 
coherent radiation since the waves associated with the stimulating and stimulated photons 
have identical frequencies (but see section 5.7), are in phase, have the same state of polar- 
ization and travel in the same direction. This means that with stimulated emission the ampli- 
tude of an incident wave can grow as it passes through a collection of excited atoms in what 
is clearly an amplification process. As the absorption transition, in common with stimu- 
lated emission, can only occur in the presence of photons of appropriate energy, it is often 
referred to as stimulated absorption. These two processes may be regarded as the inverse 
of one another. 

The above discussion ignores the fact that the emission and absorption processes do not 
simply involve photons of a precisely defined energy. Consequently there is a range of fre- 
quencies associated with these processes. The distribution of energies or frequencies in a 
given transition is described by the lineshape function g(v), which is discussed in section 5.7. 



C 5 - 2 J ) Einstein relations 

Einstein (ref. 5.3) showed that the parameters describing the above three processes are related 
through the requirement that for a system in thermal equilibrium the rate of upward transi- 
tions (from £, to E 2 ) must equal the rate of the downward transition processes (from E 2 to 
£,). Let us suppose that our simple two-level atomic energy level system is in equilibrium 
inside a blackbody cavity, where, as we saw in section 1 .4, the radiation covers a very wide 
frequency range. Although we indicated at the end of the previous section that the upward 
and downward transitions involve a spread of frequencies, we may assume that this will be 
very small compared with that of a blackbody. Nevertheless in further considering the tran- 
sitions we must take into account the behaviour of photons within these frequency distri- 
butions. This analysis is undertaken in Appendix 4; meanwhile we adopt a simpler, 
approximate approach. 

If there are N l atoms per unit volume in the collection with energy £,, then the upward 
transition or absorption rate will be proportional to both /V, and to the number of photons 
available at the correct frequency. Now p i( , the energy density at frequency v, is given by 
p =,tfjiv where H v is the number of photons per unit volume having frequency v. Therefore 
we may write the upward transition rate as N { p v B ,, where B i2 is a constant. Similarly if there 
are N 2 atoms per unit volume in the collection with energy E 2 then the induced transition 
rate from level 2 to level 1 is N 2 p v B ]2 , where again B 2l is a constant. The spontaneous tran- 
sition rate from level 2 to level I is simply N,A 2l . The total downward transition rate is the 
sum of the induced and spontaneous contributions, that is 

N 2 p v B 2l +N 2 A 2I 

A 7 B 1X and B ]2 are called the Einstein coefficients: the relationships between them can be 
established as follows. 

For a system in equilibrium, the upward and downward transition rates must be equal and 
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hence we have 

NiP v B l2 = N 2 p v B 2l + N 2 A 2l 

Thus 

N 2 A 2l 



(5.1) 



Pv : 



N,B n -N,B 



2"21 



Pv : 



A 2] /B 2 



(5.2) 



(BjB^iNjNJ-l 

Now, the populations of the various energy levels of a system in thermal equilibrium are 
given by Boltzmann statistics to be 

' l 8i exp(-E,A-J) 

where N. is the population density of the energy level E , N is the total population density 
and g is the degeneracy of they'th level. 1 Hence 



8 



— L = — exp[(E, -E x )lkT\ = — eKp(hv/kT) 

N 2 g 2 " 8i 

Therefore, substituting eq. (5.4) into eq. (5.2) gives 



Pv = 



A 2 JB 2{ 



[(gJg 2 )(B l2 /B 2l )exp(hv/kT)]- \ 



(5.4) 



(5.5) 



Since the collection of atoms in the system we are considering is in thermal equilibrium it 
must give rise to radiation which is identical with blackbody radiation, eq. (1.42), the radi- 
ation density of which can be described by (see ref. 1.1c) 

87t/3vV/ 1 \ 

Pv ~ c 5 \exp(hv/kT)-\) 

where n is the refractive index of the medium. 
Comparing eqs (5.5) and (5.6) for p v , we see that 

and 



(5.6) 



(5.7) 



A 2 \ 



8t:/ivV 



(5.8) 



Equations (5.7) and (5.8) are referred to as the Einstein relations. The second relation enables 
us to evaluate the ratio of the rate of spontaneous emission to the rate of stimulated emis- 
sion for a given pair of energy levels. We see that this ratio is given by 

^21 



R = 



P V B 



(5.9) 
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or 

/? = exp(/n>/AT) - 1 ( 5_y a) 

EXAMPLE 5.1 Ratio of rates of spontaneous and stimulated emission „ 



Let us calculate this ratio for a tungsten filament lamp operating at a temperature of 
2000 K. Taking the average frequency to be 5 x 10 u Hz we see from eq. (5.9b) that 
the ratio is 

/6.6x 10~ 34 x5x 10 u \ 

/? = exp -. - 1 =exp(12)= 1.5 x 10 s 

*\ 1.38 x 10" 23 x2000 J K 

This confirms that under conditions of thermal equilibrium, stimulated emission is not 
an important process. For sources operating at lower temperatures and higher fre- 
quencies, stimulated emission is even less likely. 



The above discussion indicates that the process of stimulated emission competes with the 
processes of spontaneous emission and absorption. Clearly, if we wish to amplify a beam of 
light by stimulated emission then we must increase the rate of this process in relation to the 
other two processes. Consideration of eq. (5.1) indicates that to achieve this for a given pair 
of energy levels we must increase both the radiation density and the population density N-, 
of the upper level in relation to the population density A/, of the lower level. Indeed, we shall 
show that to produce laser action we must create a condition in which TV, >{g-,/g ] )N l even 
though E 2 >E { , that is we must create a so-called population inversion. Before describing this 
situation in detail it will be instructive to look more closely at the process of absorption. 



(~TiT ) Absorption of radiation 

Let us consider a collimated beam of perfectly monochromatic radiation of unit cross-sec- 
tional area passing through an absorbing medium. We assume for simplicity that there is 
only one relevant electron transition, which occurs between the energy levels £, and £,. Then 
the change in irradiance of the beam as a function of distance is given by 

A/(x)=/(.v+A.v)-/(.v) 

For a homogeneous medium A/(.v) is proportional both to the distance travelled Ax and to 
I(x). That is, M(x) = -a/(.v)Ax, where the constant of proportionality, a, is the absorption 
coefficient. The negative sign indicates the reduction in beam irradiance due to absorption 
as a is a positive quantity. Writing this expression as a differential equation we have 

^2 = -«/(,) 

Ax 
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Integrating this equation gives 

/ = /„ exp{-av) ( 5 - 10 ) 

where /„ is the incident irradiance. 

Let us consider the absorption coefficient in more detail. Clearly, the degree or 
absorption of the beam will depend on how many atoms /V, there are with electrons in 
the lower energy state £, and on how many atoms N-, there are in energy state E 2 - If 
N 2 were zero, then absorption would be a maximum, while if all of the atoms were in 
the upper state the absorption would be zero and the probability of stimulated emis- 
sion would be large. 

From the discussion of induced or stimulated transitions given in section 5.2, we can write 
an expression for the net rate of loss of photons per unit volume, -d> s ,/d/. from the beam 
as it travels through a volume element of medium of thickness Av and unit cross-sectional 
area (Fig, 5.3) as 

-^p^-A/.p^,, 
at 



or substituting from eq. (5.7) 



dJf v 
At 



Si 



N.-NAp.B 



(5.11) 



In this discussion, we have deliberately ignored photons generated by spontaneous emis- 
sion as these are emitted randomly in all directions and do not therefore contribute to the 
collimated beam. Similarly we have ignored scattering losses. 

We can now link eq. (5.1 1), which contains the difference in populations of the two energy 
levels, to the absorption coefficient a. We recall that the irradiance of the beam is the energy 
crossing unit area in unit time and therefore is given by the energy density times the speed 
of light in the medium, that is l = pc/n, or for photons of frequency v, I x . = p v c/n=J l f v hvc/n, 
where c is the speed of light in vacuo and n is the refractive index of the medium. Hence 
the change in photon density within the beam between the boundaries .v and .v + A.v of the 




I(x + Ax) 



x x + Ax 

FIG. 5.3 Radiation passing through a volume element of length Ax and unit cross-soctional area. 
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volume element can be written as 

- d.N\.(.v) = [7,.(.v) - /,(.v + A.v)] -^~ 

hv 2i c 

If A.v is sufficiently small we can rewrite this equation as 

d/J.v) A.VM 

-d.Y„(.v) = —• 

dv liVi^c 

Thus the rate of decay of photon density in a time interval dr (= A.v/(c//0) i- s 

d.M\ d/ v .(-v) 1 
dr d.v /rv 2I 

and substituting for d//dv from eq. (5.10) gives 

dX 1 r 1 

i-^ = -al v (x) ■ = -ap„ - • - — (5.12) 

d! ^' 2 \ n hv 2] 

Hence comparing eqs (5.1 1) and (5.12) we have 

n hv 2l \£, / 

Therefore the absorption coefficient a is given by 

ii/V.-^J-^^- (5.13) 

We see from eq. (5.1 3) that a, as we supposed earlier, depends on the difference in the pop- 
ulations of the two energy levels E l and E 2 , For a collection of atoms in thermal equilib- 
rium, since E 2 >E l , (g : /#i) jV i wil1 alwavs be greater than /V 2 (eq. 5.4) and hence a is positive. 
If however, we can create a situation, referred to in the previous section, in which /V 7 becomes 
greater than (g 2 /g\)N\ tnen ais negative and the quantity (-av) in the exponent of eq. (5.10) 
becomes positive. Thus the irradiance of the beam grows as it propagates through the medium 
in accordance with the equation 

/ = / ( ,exp(tv) (5.14) 

where k is referred to as the small signal gain coefficient and is given by 

k^iNj-^NAB^—^- (5.15) 



(jX) Population inversion 

The population inversion condition required for light amplification is a non-equilibrium dis- 
tribution of atoms among the various energy levels of the atomic system. The Boltzmann 
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FIG. 5.4 Populations of a two-level energy system: (a) in thermal equilibrium; (bl after a population inver- 
sion has been produced. 



distribution which applies to a system in thermal equilibrium is given by eq. (5.3) and is illus- 
trated in Fig. 5.4(a); Nj is the population density of the y'th energy level and clearly as Ej 
increases Nj decreases for a constant temperature. We note that if the energy difference 
between £, and E 2 were nearly equal to kT (- 0.025 eV at room temperature) then the pop- 
ulation of the upper level would be exp(- 1 ) or 0.37 of that of the lower level. For an energy 
difference large enough to give visible radiation (=2.0 eV), however, the population of the 
upper level is almost negligible as Example 5.2 shows. 



EXAMPLE 5.2 Relative populations of energy levels 



We may estimate the relative populations in thermal equilibrium of two energy levels 
such that a transition from the higher to the lower will give visible radiation. 

Let us take the average wavelength of visible radiation as 550 nm; then 
E 2 -Ei= /u-/(550 x 10" 9 ) = 3.6 x 10" 19 J = 2.25 eV. 

Assuming room temperature (7=300 K) and that g l = g 2 , we have from eq. (5.4) 



N? 



-3.6 x 10" 



N, eXP \1.38x 10" 23 x300 



i = exp(-87)»10- 



Clearly, then, if we are to create a population inversion, illustrated in Fig. 5.4(b), we must 
supply a large amount of energy to excite atoms into the upper level E 2 . This excitation 
process is called pumping and much of the technology of lasers is concerned with how the 
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pumping energy can be supplied to a given laser system. Pumping produces a non-thermal 
equilibrium situation; we shall now consider in general terms how pumping enables a pop- 
ulation inversion to be achieved. 

5.4.1 Attainment of a population inversion 

One of the methods used for pumping is stimulated absorption: that is. the energy levels which 
one hopes to use for laser action are pumped by intense irradiation of the system. Now as 
B I2 and B 2l are equal (assuming g , = g 2 ), once atoms are excited into the upper level the prob- 
abilities of further stimulated absorption or emission are equal so that even with very intense 
pumping the best that can be achieved with the two-level system, considered hitherto, is 
equality of the populations of the two levels. 

A.s a consequence we must look for materials with either three or four energy level systems; 
this is not really a disadvantage as atomic systems generally have a large number of energy levels. 

The three-level system first proposed by Bloembergen (ref. 5.4) is illustrated in Fig. 5.5. 
Initially the distribution obeys Boltzmann's law. If the collection of atoms is intensely illu- 
minated the electrons can be excited (i.e. pumped) into the level E 2 from the ground state 
E () . From E 2 the electrons decay by non-radiative processes to the level £", and a population 
inversion may be created between £", and £ n . Ideally, the transition from level E 2 to £", should 
be very rapid, thereby ensuring that there are always vacant states at E 2 , while that from £, 
to £■(, should be very slow, that is £", should be a metastable state. This allows a large build- 
up in the number of atoms in level £,, as the probability of spontaneous emission is rela- 
tively small. Eventually A/, may become greater than N {) and then population inversion will 
have been achieved. 
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FIG. 5.5 Population of the energy levels by pumping in a three-level system: U) Boitzmann distribution 
: and (b) distribution after pumping and the transitions involved. 
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The level E 2 should preferably consist of a large number of closely spaced levels so that 
pumping uses as wide a part of the spectral range of the pumping radiation as possible, thereby 
increasing the pumping efficiency. Even so, three-level lasers, for example ruby, require very 
high pump powers because the terminal level of the laser transition is the ground state. This 
means that rather more than half of the ground state atoms (this number is usually very nearly 
equal to the total number of atoms in the collection) have to be pumped to the upper state 
to achieve a population inversion. 

The four-level system shown in Fig. 5.6 has much lower pumping requirements. If (£", -E ) 
is rather large compared with kT (the thermal energy at the temperature of operation), then 
the populations of the levels £,, E 2 and £ 3 are all very small in conditions of thermal equi- 
librium. Thus, if atoms are pumped from the ground state to the level £\ from which they 
decay very rapidly to the metastable level £"-,, a population inversion is quickly created 
between levels E 2 and £,. 

Again the upper level £ 3 should preferably consist of a large number of levels for great- 
est pumping efficiency. If the lifetimes of the transitions £, — > £\ and E , — > E are short, the 
population inversion between E 2 and £, can be maintained with moderate pumping and con- 
tinuous laser action can be achieved more readily. In the Nd:YAG laser, for example, 
t 21 = 0.5 ms while t io - 30 ns and, although there are many upper levels used for pumping, 
each has a lifetime of about 10" 8 s(i.e. t 12 = 10" s s). The details of the mechanisms used for 
pumping lasers can be quite complicated and, in addition to optical pumping, pumping can 
occur in an electrical discharge or by electron bombardment, the release of chemical energy, 
the passage of a current, etc. The energy level schemes of the media used in lasers are often 
complex, but they can usually be approximated by either three- or four-level schemes. 
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FIG. 5.6 Population of the energy levels in a four-level system: (a) before pumping and (b) after pumping. 
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5- 5 ) Optical feedback 

The laser, despite its name, is more analogous to an oscillator than an amplifier. In an elec- 
tronic oscillator, an amplifier tuned to a particular frequency is provided with positive feed- 
back and, when switched on, any electrical noise signal of the appropriate frequency 
appearing at the input will be amplified. The amplified output is fed back to the input and 
amplified yet again and so on. A stable output is quickly reached, however, since the ampli- 
fier saturates at high input voltages, as it cannot produce a larger output than the supply 
voltage. 

In the laser, positive feedback may be obtained by placing the gain medium between a 
pair of mirrors which, in fact, form an optical cavity (a Fabry -Perot resonator). The initial 
stimulus is provided by any spontaneous transitions between appropriate energy levels in 
which the emitted photon travels along the axis of the system. The signal is amplified as it 
passes through the medium and 'fed back' by the mirrors. Saturation is reached when the 
gain provided by the medium exactly matches the losses incurred during a complete round 
trip. 

The gain per unit length of many active media is so small that very little amplification of 
a beam of light results from a single pass through the medium. In the multiple passes which 
a beam undergoes when the medium is placed within a cavity, however, the amplification 
may be substantial. 

We have tacitly assumed that the radiation within the cavity propagates to and fro between 
two plane -parallel mirrors in a well-collimated beam. Because of diffraction effects, 
however, this cannot be the case as a perfectly collimated beam cannot be maintained with 
mirrors of finite extent; some radiation will spread out beyond the edges of the mirrors. 
Diffraction losses of this nature can be reduced by using concave mirrors. In practice a 
number of different mirror curvatures and configurations are used depending on the appli- 
cations envisaged and the type of laser being used. 

A detailed analysis of the effects of different mirror systems requires a rigorous applica- 
tion of diffraction theory and is beyond the scope of this book (see e.g. ref. 5.2a). Using simple 
ray tracing techniques, however, it is quite easy to anticipate the results of such an analysis 
in that mirror configurations which retain a ray of light, initially inclined at a small angle 
to the axis, within the optical cavity after several reflections are likely to be useful (see ref. 
5.5). Such cavities are said to be stable. 

The commonly used mirror configurations are shown in Fig. 5.7; they all have various 
advantages and disadvantages. The plane -parallel configuration, for example, is very dif- 
ficult to align, for if the mirrors are not strictly parallel (to within about 1 second of arc) the 
optical beam will 'walk off the mirrors after a few reflections. On the other hand, the radi- 
ation beam makes maximum use of the laser medium (we say that it has a large mode volume 
- see also section 5.9) as there is no focusing of the beam within the cavity. In addition, the 
mirrors need to be flat to within A./100. In contrast to the plane-parallel case, the confocal 
arrangement is relatively easy to align (an accuracy of 1.5 minutes of arc is sufficient) but 
the use of the active medium is restricted (i.e. the mode volume is small). In gas lasers, if 
maximum power output is required we use a large radius resonator, while if uniphase oper- 
ation (i.e. maximum beam coherence) is required we use the hemispherical system. 
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FIC. 5.7 Some commonly used laser cavity mirror configurations (the dashed lines show the extent of the 
mode volume in each case). 



Sometimes mirror configurations are used which give rise to unstable cavities. In these 
a ray which is initially travelling at a small angle of incidence to the cavity axis will diverge 
away from the axis after a number of reflections. Such resonators are characterized by high 
losses, but even so they have some useful properties. In particular they can make efficient 
use of the mode volume. As unstable resonators have large losses they can be used effec- 
tively only with high gain media such as carbon dioxide. As we mentioned earlier, the gain 
is usually very small so it is essential to minimize all losses in the laser (see section 5.6). 
One source of loss is absorption in the mirrors. To reduce this, high reflectance multilayer 
dielectric coatings on the mirrors are used rather than metallic coatings. In these so-called 
multilayer stacks there is a sequence of quarter-wave (i.e. X/A) layers of alternate high and 
low refractive index dielectric materials on a glass substrate. Because of the phase changes 
occurring at alternate interfaces, all the reflected waves are in phase and add constructively. 
More than 20 such layers may be needed to give reflectances in excess of 99.9% - lower 
reflectances require fewer layers. Clearly, the mirrors will only be effective over a narrow 



THRESHOLD CONDITIONS - LASER LOSSES 181 

wavelength range. A familiar example of this sort of process is the blooming of camera lenses 
to reduce unwanted reflections. 

We can now derive the minimum pump power required (i.e. the threshold condition) in 
terms of the parameters of the whole system for laser oscillations to occur. 



Q^6J Threshold conditions - laser losses 

It was explained above that a steady state level of oscillation is reached when the rate of 
amplification is balanced by the rate of loss. This is the situation in continuous output (CW) 
lasers; it is a little different in pulse lasers. Thus, while a population inversion is a neces- 
sary condition for laser action, it is not a sufficient one because the minimum (i.e. thresh- 
old value) of the gain coefficient must be large enough to overcome the losses and sustain 
oscillations. The threshold gain, in turn, through eq. (5.15) specifies the minimum popula- 
tion inversion required. 

The total loss of the system is due to a number of different processes; the most important 
ones include: 

1 . Transmission at the mirrors - the transmission from one of the mirrors usually provides 
the useful output, the other mirror is made as reflective as possible to minimize losses. 

2. Absorption and scattering at the mirrors. 

3. Absorption in the laser medium due to transitions other than the desired transitions (as 
mentioned earlier most laser media have many energy levels, not all of which will be 
involved in the laser action). 

4. Scattering at optical inhomogeneities in the laser medium - this applies particularly to 
solid state lasers. 

5. Diffraction losses at the mirrors. 

To simplify matters, let us include all the losses except those due to transmission at the mirrors 
in a single effective loss coefficient y which reduces the effective gain coefficient to (k-y). 
We can determine the threshold gain by considering the change in irradiance of a beam of 
light undergoing a round trip within the laser cavity. We assume that the laser medium fills 
the space between the mirrors M, and M 2 which have reflectances R } and /?, and a separa- 
tion L. Then in travelling from M, to M, the beam irradiance increases from / (1 to / where, 
from eq. (5.14). 

/ = / exp[(*-Y)Z,] 

After reflection at M 2 , the beam irradiance will be R 2 I () exp[{k~y)L] and after a complete 
round trip the final irradiance will be such that the round trip gain G is 

final irradiance 

G = -rr-r^-r = R i R i exp[2(* - y)L] 

initial irradiance 

If C is greater than unity a disturbance at the laser resonant frequency will undergo a net 
amplification and the oscillations will grow; if G is less than unity the oscillations will die 
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out. Therefore we can write the threshold condition as 

G = R l R 2 ew[2(k lh -y)L]=\ (5-16) 

where /.,„ is the threshold gain. It is important to realize that the threshold gain is equal 
to the steady state gain in continuous output lasers, that is k lh = k^. This equality is due to 
a phenomenon known as gain saturation, which can be explained as follows. Initially, when 
laser action commences the gain may be well above the threshold value. The effect of stim- 
ulated emission, however, will be to reduce the population of the upper level of the laser 
transition so that the degree of population inversion and consequently the gain will 
decrease. Thus the net round trip gain may vary and be greater than or less than unity so 
that the cavity energy density will correspondingly increase or decrease. It is only when 
G has been equal to unity for a period of time that the cavity energy (and laser output 
power) settles down to a steady state value, that is when the gain just balances the losses 
in the medium. In terms of the population inversion there will be a threshold value 
^ti, ~ ^2~ U' 2/#tV v i ],h corresponding to k lh . In steady state situations [N 2 -(g 2 /gi)AM 
remains equal to N ih regardless of the amount by which the threshold pumping rate is 
exceeded (see section 5.8). The small signal gain required to support steady state opera- 
tion depends on the laser medium through k and y, and on the laser construction through 
R v R 2 and L. From eq. (5.16) we can see that 

;• - y + — i n (-L\ (5.17) 

where the first term represents the volume losses and the second the loss in the form of the 
useful output. Equation (5.15) shows that k can have a wide range of values, depending not 
only on [A/ 2 - (fo/tfi)^ J but a ^ so on tne intrinsic properties of the active medium. If k is high 
then it is relatively easy to achieve laser action, mirror alignment is not critical and dust can 
be tolerated on the mirrors. With low gain media, on the other hand, such losses are unac- 
ceptable and the mirrors must have high reflectances, and be scrupulously clean and care- 
fully aligned. 

It should be noted that a laser with a high gain medium will not necessarily have a 
high efficiency. The efficiency is the ratio of the output light power to the input pumping 
power. It therefore depends on how effectively the pump power is converted into pro- 
ducing a population inversion, on the probabilities of different kinds of transitions from 
the upper level and on the losses in the system. With reference to Fig. 5.6(b), and con- 
fining our attention to optical pumping, we can easily see that the efficiency cannot 
exceed (E 2 - £,)/(£,- £ ) = v 2 ,/v, for the four-level system and that it will be con- 
siderably less than this for the three-level system, where over half the atoms have to be 
pumped out of the ground state before population inversion is produced. The actual effi- 
ciencies, as defined above, are usually very much less than this because of the energy 
loss in converting electrical energy into optical energy at the pump frequency and the 
fact that not all the atoms pumped into level 3 will necessarily make a transition to level 
2. Certain lasers (e.g. C0 2 ) are characterized by having a high efficiency and a high small 
signal gain. Other lasers such as argon, although having a high gain, have a very low 
efficiency. 
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C 5 - 7 } Lineshape function 

In deriving the expression for the small signal gain we assumed that all the atoms in either 
the upper or lower levels would be able to interact with the (perfectly) monochromatic beam. 
In fact this is not so; spectral lines have a finite wavelength (or frequency) spread, that is 
they have a spectral width. This can be seen in both emission and absorption and if, for 
example, we were to measure the transmission as a function of frequency for the transition 
between the two energy states £, and £,, we would obtain the bell-shape curve shown in 
Fig. 5.8(a). 

The emission curve would be the inverse of this (see Fig. 5.8b). The shape of these curves 
is described by the lineshape function g(v), which can also be used to describe a frequency 
probability curve. Thus we may define g(v) dv as the probability that a given transition 
between the two energy levels will result in the emission (or absorption) of a photon whose 
frequency lies between v and v + dv. g(v) is normalized such that J°°„ g(v) dv = 1 . Therefore 
we see that a photon of energy hv may not necessarily stimulate another photon of energy 
hv. We then take g(y) dv as the probability that the stimulated photon will have an energy 
between hv and h(v + dv). 

It is shown in Appendix 4 that, when a monochromatic beam of frequency v K interacts with 
a group of atoms with a lineshape function g(v), the small signal gain coefficient may be 
written as 



*(v s ) 




fl 21 /iv x «g(v s ) 



(5.18) 



The form of the lineshape function g(v) depends on the particular mechanism responsible 



Transmission 




Emission 




(a) 

FIC. 5.8 Transmission curve for transitions between energy levels f , and £_, (a) and the emission curve for 
transitions between E 2 and f , (b). The precise form of these curves (the lineshape) depends on the spectral 
broadening mechanisms. 
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for the spectral broadening in a given transition. The three most important mechanisms are 
Doppler broadening, collision (or pressure) broadening and natural (or lifetime) damping, 
which are described briefly below; the interested reader is referred to ref. 5.6 for further 
details. 



EXAMPLE 5.3 Small signal gain coefficient 



It may be instructive to calculate the value of population inversion required to give a 
gain coefficient of 1 m"" 1 in a given laser. We take Nd:YAG for which we have the 
following data: spontaneous lifetime, i 2I =230 us; wavelength, X= 1.06 urn; refrac- 
tive index, n= 1.82; and linewidth, Av = 3 x 10 1 " Hz. 

From eq. (5.8) we have £ 2I =/^/(8n/n 21 n 3 ) = 5.15 * 10 16 nv W' 1 s" 3 . Therefore 
from eq. (5.18) we have (with£ = 1 andg(v s ) = 1/Av, see eq. 5.21) 

kt 82 m \ a ° AV .n ,n22 -i 
N-, A\l = = 5.12x10" m - 

B 2l hn 



DOPPLER BROADENING 

We are familiar with the Doppler effect, which occurs because of the relative motion of a source 
and observer. The frequency as measured by the observer increases if the source and observer 
approach one another and decreases as they recede. This effect applies to a collection of atoms 
emitting at an optical frequency v !2 , so that the observed frequency is given by 




v 12 = v 

where v x . is the component of the velocity of the atom along the direction of observation (we 
assume v x < c)). Since the atoms are in random motion, an observer would measure a range 
of frequencies depending on the magnitude and direction of v K . That is, as far as the observer 
is concerned the collection of atoms would be emitting at a range of different resonant fre- 
quencies resulting in a broadening of the emission lineshape. The individual Doppler-shittea 
resonant frequencies contribute to the smooth Doppler-broadened lineshape. 

The mean-squared velocity components v x depend on the temperature according to 
\Mv\ = [kT, where M is the atomic mass, so that the halfwidth (full width of the curve 
at half the maximum intensity of emission) of the curve is proportional to the square root 
of T. Doppler broadening is the predominant mechanism in most gas lasers emitting in 
the visible region. For example, the halfwidth of the 632.8 nm transition of the He-Ne 
laser is about 2 x 10~ 3 nm, assuming a temperature of operation of about 400 K (see 
Problem 5.6). ^ 

Halfwidths are often expressed in terms of frequency: thus a halfwidth of 2 x 10 ' nm cor- 
responds to a frequency halfwidth of 1500 MHz. (As c = vX, we may write dv = -(cA 2 ) <& 
and hence dv - (3 x 10 s x 2 x lQ- 12 )/(632.8 x 10" 9 ) 2 = 1500 MHz.) 
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COLLISION BROADENING 

The Doppler linewidth of molecular lasers such as the CO, laser is relatively small because 
of their low resonant frequencies (in the infrared) and comparatively large molecular masses. 
In such lasers, collision broadening becomes important. Collision broadening also occurs 
in doped insulator lasers. In these lasers the ions of the active medium may suffer collisions 
with phonons, that is quantized lattice vibrations. 

If an atom which is emitting a photon suffers a collision, then the phase of the wave train 
associated with the photon is suddenly altered. This in effect shortens the emitted wave trains, 
which can be shown by Fourier techniques (ref. 5.7) to be equivalent to a broadening of the 
spectral line. Clearly, the higher the pressure (and temperature) of the gas the more frequently 
will the atoms suffer collisions and the greater will be the spectral broadening. 

NATURAL DAMPING 

It can be shown that the very act of an atomic electron emitting energy in the form of a photon 
leads to an exponential damping of the amplitude of the wave train. The effect of this is 
similar to collision broadening in that it effectively shortens the wave trains and produces 
a broadened spectral line. 

Broadening mechanisms can be classified into homogeneous and inhomogeneous broaden- 
ing. If all of the atoms of the collection have the same transition centre frequency and the 
same resonance lineshape then the broadening is termed homogeneous; such is the case for 
collision broadening. On the other hand, in some situations each atom has a slightly differ- 
ent resonance frequency or lineshape for the same transition. The observed lineshape is then 
the average of the individual ones, such as in Doppler broadening, and the mechanism is 
termed inhomogeneous. Local variations of temperature, pressure, applied magnetic field as 
well as local variations due to crystal imperfections also lead to inhomogeneous broadening 
of the emission or absorption lineshapes. The nature of the broadening is important in several 
aspect of laser theory, for example in the discussion of gain saturation mentioned earlier. 

Homogeneous broadening mechanisms lead to a Lorentzian lineshape which may be 
written as 

S(v) L = ^ 

where Av is the linewidth, that is the separation between the two points on the (frequency) 
curve where the function falls to half of its peak value which occurs at frequency v . Putting 
v = v gives 

Inhomogeneous broadening mechanisms, on the other hand. lead for a gas to a Gaussian 
frequency distribution, given by 
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and putting v = v gives 

For the purpose of later calculations we may, in fact, approximate both eqs (5.19) and (5.20) 
by 

*<Vo>-4 (5 ' 21) 

Av 

Hence at the frequency v, = v„ we may approximate g(v v ) by l/Av in eq. (5.18) for the small 
signal gain coefficient. 

Because of these various broadening mechanisms, we can no longer treat a group of atoms 
as though they all radiate at the same frequency. Instead, we must consider a small spread 
of frequencies about some central value. It might then be expected that the output of the laser 
would contain the same distribution of frequencies as the broadened transitions of the atoms 
in the medium. This is, in fact, not the case as the spectral character of the laser output is 
different from that of spontaneous emission in the same medium. Two factors account tor 
this difference: the effects of the optical resonator (discussed below) and the effect of the 
amplification process on the irradiance. As light travels through an amplifying medium the 
irradiance varies as 

7 v (v, x) = /(v, 0) exp[*(v).v] 

Equation (5.18) shows that k{v) depends on the lineshape function g(v); hence I v (y,x) is 
related exponentially to g(v). Consequently the function I v (v.x) is much greater at the centre 
and smaller in the 'wings' than the atomic lineshape. I v (v,x) is therefore narrower than the 
atomic lineshape - this effect is known as spectral narrowing. In fact, as we shall see below, 
laser light has an even narrower spectral range than suggested by this argument. 



C 5 - 8 J Population inversion and pumping threshold conditions 

We may now use eq. (5.18) to calculate the population inversion required to reach the lasing 
threshold. From eq. (5.18) we have 

\ Si J B 2l hv/ig{y % ) 
At threshold, the small signal gain coefficient is given by eq. (5.17), that is 

1 / 1 
*(v v ) = * lh = y + — In 



2L \ R [/? 2 



and therefore 
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From eq. (5.8) we have B 2] = r\4 2I /(87t/jvV ). The quantity A 2l may be determined exper- 
imentally by noting that it is the reciprocal of the spontaneous radiation lifetime t,, from 
level 2 to level I. 

Thus combining the above equations we can write the threshold population inversion as 

(5.22) 



tf Ih =|tf> 



N, 



<"g(vj 



We note that the lasing threshold will be achieved most readily when e(v s ) is a maximum, 
that is when v„ has the value v„ corresponding to the centre of the natural linewidth. We may 
therefore replace i>(vj by 1/Av (see eq. 5.21) to give 



",„ = ■ 



8Kv:Jc. h i-,,Av>r 



(5.2.1) 



We now proceed to calculate the pumping power required to reach threshold. To do this. 
we must solve the rate equations for the particular system. The rate equations describe the 
rate of change of the populations of the laser medium energy levels in terms of the emission 
and absorption processes and pump rate. We shall consider the ideal four-level system shown 
in Fig. 5.9. We assume that £", p kT so that the thermal population of level 1 is negligible: we 
also assume that the threshold population density A/, h is very small compared with the ground 
state population so that during lasing the latter is hardly affected. If we let :>k-, and ',#, be the 
rates at which atoms are pumped into levels 2 and 1 respectively, we can write the rate equa- 
tions for these levels (assuming g } = g 2 for simplicity, and hence fi,, =£,,) as 

' =9i, - AM,, - p,^ 2 ,(^ 2 -A/,) ( 5 .24) 



dt 
and 

cLV, 



= m, + p v .B 21 (AMA / I ) + AM 21 -W I /l 



I'MO 



(5.25) 



E 2 — 



Ei 




N ie ,.B t 



FIC. 5.9 Transitions within an ideal four-level system. (The principal pumping mechanism excites atoms 
to the level £,, from where they rapidly decay to the level of E,. As E, is the upper level of the laser 
transition, we have designated the corresponding pumping rate //I, rather than :>l\ ,.) 
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Process .#.,, which populates the lower laser level l . is detrimental to laser action as it clearly 
reduces the population inversion. Although such pumping is unavoidable in many lasers, 
for example gas lasers pumped via an electrical discharge (section 5.10.3). we shall hence- 
forth ignore '# ,. If we assume that the system is being pumped at a steady rate then we have 
cW 2 /dr = cW,/d/ = 0. Hence we may solve eqs (5.24) and (5.25) for A 7 , and A^. We leave it 
as an exercise for the reader to show that 

and hence 

JV,-*, = 3fJ 1 ~'* 2 ' / M (5-26) 

\ A n + P v B n) 

We can see that unless A 2] <A „,,the numerator will be negative and no population inver- 
sion can take place. As the Einstein A coefficients are the reciprocals of the spontaneous life- 
times, the condition A 21 <A m is equivalent to the condition T m <x 21 . that is the upper lasing 
level has a longer spontaneous emission lifetime than the lower level. In most lasers x 21 >i i0 
and (\-A 2 jA w )- 1. 

Now, below threshold we may neglect p v since lasing has not yet commenced and most 
of the pump power appears as spontaneous emission; thus eq. (5.26) can be written as 

/! -AjA.A 
W, -#, = &,[ 2,/ ,0 * 



, A 2l J 

That is, there is a linear increase in population inversion with pumping rate but insufficient 
inversion to give amplification. 

At threshold, p v is still small and assuming g i =#, we can express the threshold popula- 
tion inversion in terms of the threshold pump rate, that is 

i^-N^N^^^^ (5.27) 

or inserting the above approximation that (1 ~A 2 JA W )~ 1 

or 

Each atom raised into level 2 requires an amount of energy £\ so that the total pumping power 
per unit volume P^ required at threshold may be written as 
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We may substitute for/V lh from eq. (5.23) to give 



/i , ,K7n' ( 2 /: th T 2I Awr 
or 



^,1, = " : (5.28) 

This is the point at which the gain clue to the population inversion exactly equals the cavity 
losses. Further increase of the population inversion with pumping is impossible in a steady 
state situation, since this would result in a rate of induced energy emission which exceeds 
the losses. Thus the total energy stored in the cavity would increase with time in violation 
of the steady state assumption (this is the phenomenon of gain saturation described earlier). 
This argument suggests that [N.-ig.Jg^N ,] must remain equal to N lh regardless of the 
amount by which the threshold pump rate is exceeded. Equation (5.26) shows that this is 
possible providing p v B 2l is able to increase (once :'Ji 2 exceeds its threshold value given by 
eq. 5.27) so that the equality 



^2i+P». fl :i/ 
is satisfied. Now combining this equation with eq. (5.27) we have 




(5.29) 



Since the power output W of the laser will be directly proportional to the optical power 
density within the laser cavity and the pump rate into level 2 (i.e. M 2 ) will be proportional 
to the pump power P delivered to the laser, we may rewrite eq. (5.29) as 




(5.30) 

where W t) is a constant. 

Thus if the pump rate is increased above the value P lh the beam irradiance is expected to 
increase linearly with pump rate. This is borne out in practice and plots of population inver- 
sion and laser output as a function of pump rate are of the form shown in Fig. 5.10. The addi- 
tional power above threshold is channelled into a single (or a few) cavity mode(s) (see section 
5.9). Spontaneous emission still appears above threshold but it is extremely weak in rela- 
tion to the laser output as it is emitted in all directions and has a much greater frequency 
spread. 
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FIG. 5.10 Population inversion and laser power output as a function of pump rate. 



C 5 - 9 . ) Laser modes 

Examination of a laser output with a spectrometer of very high resolving power, such as the 
scanning Fabry -Perot interferometer, reveals that it consists of a number of discrete fre- 
quency components (or very narrow spectral lines). To appreciate how these discrete lines 
arise and how they are related to the laser transition lineshape we need to examine the effects 
of the mirrors on the light within the laser cavity (see refs 5.8 and 5.2a). 



5.9.1 Axial modes 

The two mirrors of the laser form a resonant cavity and standing wave patterns are set up 
between the mirrors in exactly the same way that standing waves develop on a string or within 
an organ pipe. The standing waves satisfy the condition 

X 



or 



v = ■ 



EL 

2L 



(5.31) 



where, strictly speaking, L is the optical path length between the mirrors, in which case^ 
would be the vacuum wavelength and p is an integer, which may be very large (e.g. if 
L = 0.5 m and X = 500 nm then p - 2 x 1 6 ). As p has such a large value, many different values 
of/; are possible for only a small change in wavelength. Each value of p satisfying eq. (5.31) 
defines an axial (or longitudinal) mode of the cavity. 

From eq. (5.31), the frequency separation 5v between adjacent modes (6>= I) is given 
by 



6v = — 
1L 



(5.32) 
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and therefore for L = 0.5 m, 6v = 300 MHz. As eq. (5.32) is independent of>, the frequency 
.separation of adjacent modes is the same irrespective of their actual frequencies. The modes 
of oscillation of the laser cavity will consist, therefore, of a large number of frequencies, 
each given by eq. (5.3 l) and separated by c/2L, as illustrated in Fig. 5. 1 1(b). 

l! should be appreciated, however, that while all the integers /; give possible axial cavity 
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FIG. 5-1 1 Broadened laser transition line (a), cavity modes (b), and axial modes in the laser output (c). 
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modes only those which lie within the gain curve or laser transition line will actually oscil- 
late. Thus the broadened laser transition (Fig. 5.1 la) for the 632.8 nm wavelength emitted 
by neon is about 1.5 x 10 y GHz wide so that with the 0.5 m long cavity in the above example 
we would expect four or five modes to be present as illustrated in Fig. 5.1 1(c). Figure 5.12 
shows the axial modes of an He-Ne laser about 1 m in length which are displayed with the 
aid of an optical frequency analyzer and silicon photodiode. 

The axial modes all contribute to a single 'spot' of light in the laser output, whereas the 
transverse modes discussed below may give rise to a pattern of spots in the output. If the 
linewidth of the axial modes is measured it will be found to be much narrower than the width 
of the Fabry-Perot resonances to be expected from treating the cavity simply as a 
Fabry-Perot interferometer (see ref. 5.9). We can appreciate the reason for this by consid- 
ering the quality factor Q of the resonator. Q can be defined in general by 

2n x energy stored in the resonator 
energy dissipated per cycle 
or 

resonant frequency v 



e = 



linewidth Av 



For an electrical oscillator, Q may be approximately 100. However, for a laser, Q may be 
about 10" and hence Av - 1 MHz, which is much narrower than the Fabry-Perot resonances, 
which are about 10 4 Hz. Indeed, in lasers the active medium is actually supplying energy 




FIG. 5.12 Axial modes formed in the He-Ne laser - the mode pattern is repeated (here, three times) as 
the optical frequency analyzer scans through the gain curve of the laser. (Photograph courtesy of Dr I. D. 
Latimer, School of Engineering, University of Northumbria at Newcastle.) 
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to the oscillating modes so that in theory the energy dissipation can be zero and Q infinite. 
In practice, there are always losses which prevent this happening, but even so linewidlhs of 
about I Hz have been achieved. 

5.9.2 Transverse modes 

Axial modes are formed by plane waves travelling axially along the laser cavity on a line 
joining the centres of the mirrors. For any real laser cavity there will probably be waves trav- 
elling just off axis that are able to replicate themselves after covering a closed path such as 
that shown in Fig. 5.13(a). These will also give rise to resonant modes, but because they have 
components of their electromagnetic fields which are transverse to the direction of propa- 
gation they are termed transverse electromagnetic (or TEM) modes. A complete analysis 
of TEM modes is quite complicated and will not be attempted here (but see Chapter 8 for 
a further discussion). They are characterized by two integers q and r so that, as Fig. 5. 1 3(b) 
shows, we have TEM^,, TEM 01 , TEM,,, etc., modes {q gives the number of minima as the 
beam is scanned horizontally and r the number of minima as it is scanned vertically). 

In a TEM (X1 mode, the irradiance distribution across the beam is in fact Gaussian, and so 
we may write the electric field variation as 



jr 2 + v 



w' 



c i(x,y)=% exp\ -~ (5 . 33) 



where x and y are measured in directions perpendicular to the laser axis which is taken to be 
along the z direction. The sideways spread of the beam is determined by the value of the para- 
meter w, which is a function of the distance z. When x 2 + v 2 > w\ the field falls off rapidly with 
distance away from the laser axis. The value of w is determined by the locus of points where 
the field amplitude has fallen to exp(-l) of its maximum value (i.e. where x 2 +y l = w 1 ). Figure 
5.14 shows the typical variation of w, with position, within a cavity formed by two concave 
mirrors of radius of curvature r, and r 2 separated by L. Such a cavity can be shown to be stable 
when L < r, + r 2 . The surfaces of constant phase are not in general plane, but are perpendicu- 
lar to the contours of constant field strength. It can be seen that the mirrors themselves are sur- 
faces of constant phase. This is no accident, but merely a direct consequence of the requirement 
that the mode be self-replicating as the light energy flows backwards and forwards between 
the mirrors. At one position within the cavity, the wavefronts become plane and, in fact, a plane 
mirror placed at this point would give rise to a hemispherical cavity. At this point also w has 
its smallest value, w . The variation of w with z is given by (ref. 5.10) 

11/2 

(5.34) 



w(z) = w { 



1 + : A 

7TWr 



where z is measured from the position of minimum beam diameter. The precise value of w 
depends on the type of cavity. In the case of a nearly confocal cavity where r =r-, = r>L'n 
can be shown that (ref. 5.5) 

wl^Wr-L)]^ or w 2 ^ (535) 
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FIC. 5.1 3 (a) Example of a non-axial self-replicating ray that gives rise to transverse modes, (b) Some low 
order transverse modes of a laser. The modes are designated TEM , where q and r are integers referring to 
the number of minima as the laser beam is scanned horizontally and vertically. The TEM*, mode is a com- 
bination of TEM 01 and TEM 10 modes. (From M. ). Beesley, Lasers and their Applications, 1 972; courtesy of 
Taylor and Francis Ltd.) 



Combining eqs (5.34) and (5.35) yields 



w{z) = w 



>,\21 



l+l 



1/2 



(5.36) 



Equation (5.36) applies outside as well as inside the cavity, and at large distances from the 
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Constant phase surfaces 




FIC. 5.1 4 TEM )10 mode within a laser cavity. The mode 'adjusts' itself so that the mirror surfaces are 
surfaces of constant phase. The value of w is determined by the locus of points where the field amplitude 
has fallen to exp( — 1) of its maximum value. 



laser, that is when z>r,we have 



w(z)= 



Ttvt-Y 



(5.37) 



As the TEM 01 and higher order modes extend further from the cavity axis than the TEM^ 
mode, they will only oscillate if the aperture of the cavity is large enough. These and higher 
order modes can, therefore, be eliminated by narrowing the laser cavity leaving just the 
TEM (X) mode oscillating. The TEM 0() mode is often called the uniphase mode, since all parts 
of the propagating wavefront are in phase; this is not the case with higher order modes and 
in fact phase reversals account for the higher order transverse mode patterns. Consequently, 
a laser operating only in the TEM^ mode has the greatest spectral purity and degree of coher- 
ence, while operation in multimode form provides considerably more power. 

It should be noted that each transverse mode will have the axial modes discussed above 
associated with it, so that the total spread in the laser spectrum may be (relatively) large. 



(J-jo} Classes of laser 

In the 40 years since Maiman reported the first observation of successful laser action in ruby, 
there has been an extremely rapid increase in the types of lasers and in the range of mate- 
rials in which lasing has been shown to occur. It is not possible to describe all of these devel- 
opments, so in this section we have concentrated on a description of the construction and 
mode of operation of some of the more commonly available and important lasers. These are 
classified into four groups: doped insulator, semiconductor, gas and dye lasers. 
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Before discussing these laser types it might be useful to remind ourselves of the basic 
requirements which must be satisfied for laser operation. 

First, there must be an active medium which emits radiation in the required region of the 
electromagnetic spectrum. Secondly, a population inversion must be created within the 
medium; this, in turn, requires the existence of suitable energy levels associated with the 
lasing transition for pumping. Thirdly, for true laser oscillation there must be optical feed- 
back at the ends of the medium to form a resonant cavity (satisfying the first two conditions 
can provide light amplification but not the highly collimated, monochromatic beam of light 
which makes lasers so useful). 

5.10.1 Doped insulator lasers 

The term doped insulator laser is used to describe a laser whose active medium consists of 
a crystalline or amorphous (glassy) host materia! containing active ions, typically from the 
transition metal and rare earth groups of elements in the periodic table. These ions are impur- 
ities which are intentionally introduced (i.e. doped) either into the crystal during its growth, 
or to the melt from which the glass solidifies. In addition to the wide range of ions men- 
tioned above there are several suitable host materials available, such as sapphire (AI 2 3 ), 
garnets, aluminates and fluorides, thereby providing a large range of lasing wavelengths, 
together with associated pumping transitions. 

The principal characteristics of suitable host crystals, normally in the form of a rod, rec- 
tangular bar or fiber include: 

• high degree of optical uniformity and freedom from defects which act as scattering centres, 

• low expansivity and high thermal conductivity to reduce thermal stress, and to maintain 
a uniform refractive index when heated during the pumping process; 

« easy crystal growth with lattice sites which readily accept the impurity ions at high doping 
levels, up to approximately 10 26 m~\ 

Glasses are much easier to fabricate with uniform composition and optical properties, and 
can be easily made into a variety of different shapes. They can also normally be doped to 
an even higher concentration than the crystalline hosts. 

Doped insulator lasers are rugged, easy to maintain and capable of generating high peak 
powers. They are invariably optically pumped and the availability of suitable pumping 
sources of light is an important consideration. Typical examples are the ruby, Nd:YAu, 
alexandrite, YLF and silicate glass lasers. Although the ruby laser is interesting in that it 
was the first successful laser, the Nd:YAG laser is now much more widely used and we 
describe it in some detail, giving only passing reference to the ruby and other lasers. 

Nd:YAC LASER 

The active medium for this laser is yttrium aluminium garnet (Y,Al s 12 ) with the rare earth 
metal ion neodymium Nd u present as an impurity. The Nd' + ions, which are randomly dis- 
tributed as substitutional impurities on lattice sites normally occupied by the yttrium ions, 
provide the energy levels for both the lasing transitions and pumping. Though the YAG host 
itself does not participate directly in the lasing action, it does have two important roles. 
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When an Nd" ion is placed in a host crystal lattice it is subject to the electrostatic field 
of the surrounding ions, the so-called awstal field. The crystal field of the host interacts with 
the electron energy levels in a variety of ways depending on such factors as its strength and 
symmetry, and on the electron configuration of the impurity. 

A neodymium ion which is free to move in a gaseous discharge, for example, has main 
of its energy levels with the same energy; these are said to be degenerate. When the ion is 
placed in the host the crystal field splits some of the energy levels, thereby partly remov- 
ing the degeneracy. A rather more important effect in the case of the Nd:YAG laser is that 
the crystal field modifies the transition probabilities between the various enemy levels of 
the Nd u ion so that some transitions, which are forbidden in the free ion. become allowed. 

The net result is that the ground and first excited state energy levels of the Nd ,! ion split 
into the groups of levels shown in Fig. 5.1 5. The symbols used to describe the energy levels 
in this and succeeding diagrams depend on the exact nature o\' the ions and atoms involved. 
In the case under discussion, the symbols arise from Russel-Saunders or LS coupling (ref. 
5. 1 I ). The symbol for an energy level is written 2S *%. Here S is the vector sum of the\dec- 
tron spins of the ion. X gives the vector sum L of the orbital angular momentum quantum 
numbers, where values of L = 0. 1.2,3.4,... are designated by S,P,D.F,G Finally./ is 
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FIG. 5.1 5 Simplified energy level diagram-" for the neodvmium ion in YAG showing the principal laser 
transitions. Laser emission also results from transitions between the T, , levels and the 'I - , and 4 I 
levels, but at only one-tenth of the intensity ot the transition shown. 
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the vector sum of S and L (see ref. 5.11). Thus in the 4 F, /2 level, 5 = 3/2 (there are three 
unpaired electrons in the 4f subshell of the Nd 3+ ion), L = 3 and J = 3/2. Whilst familiarity 
with this and similar notations for electron energy levels is not essential to appreciate the 
basics of a given laser, it is a prerequisite for a detailed understanding of the mechanisms 
involved. . 

As we can see from Fig. 5.15 the Nd:YAG laser is essentially a four-level system, that is 
the terminal laser level 4 Ij 1/2 is sufficiently far removed from the ground state I 9/2 that its 
room temperature population is very small. Whilst a number of laser transitions may occur 
between some of the pairs of levels shown to the right of the figure, the most intense line 
at 1.064 urn arises from the superposition of the two transitions shown. 

Pumping is normally achieved by using an intense flash of white light from a xenon flash- 
tube. This excites the Nd 3+ ions from the ground state to the various energy states above the 
4 F 3/2 state; there are, in fact, many more states at higher energies than are shown in Fig. 5. ID. 
The presence of several possible pumping transitions contributes to the efficiency of the laser 
when using a pumping source with a broad spectral output. To ensure that as much radia- 
tion as possible from the flashtube is absorbed in the laser medium, close optical coupling 
is required. The usual arrangement is shown in Fig. 5.16; a linear flashtube and the lasing 
medium in the form of a rod are placed inside a highly reflecting elliptical cavity. If the flash- 
tube is along one focal axis and the laser rod along the other, then the properties of the ellipse 
ensure that most of the radiation from the flashtube passes through the laser. The flashtube 
is fired by discharging a capacitor bank through the tube; the discharge is often initiated by 
using a secondary high voltage (~ 20 kV) trigger pulse. 

As the pumping flash lasts for only a short time (« 1 ms) the laser output is in the form 
of a pulse, which starts about 0.5 ms after the pumping flash starts. This represents the time 
for the population inversion to build up. Once started, stimulated emission builds up rapidly 
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F1C. 5.16 Typical construction of a doped insulator laser showing the ellipsoidal reflector used to 
maximize optical coupling between the flashtube and laser rod. 
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and thus depopulates the upper Iasing level 2 - much faster in fact than the pumping can 
replace the excited atoms so that laser action momentarily stops until population inversion 
is achieved again. This process then repeats itself so that the output consists of a large number 
of spikes of about I us duration with about I us separation. As the spikes are unrelated, the 
coherence of the laser pulse (which lasts a total of about 0.5 ms), while being much greater 
than that of classical light sources, is not as great as might be expected. 

The optical cavity may be formed by grinding the ends of the Nd:YAG rod flat and par- 
allel and then silvering them. More usually, however, external mirrors are used as shown 
in Fig. 5.16. One mirror is made totally reflecting while the other is about 10% transmitting 
to give an output. 

A large amount of heat is dissipated by the flashtube and consequently the laser rod quickly 
becomes very hot. To avoid damage resulting from this, and to allow a reasonable pulse re- 
petition rate, cooling has to be provided by forcing air over the crystal using the reflector 
as a container. For higher power lasers it is necessary to use water cooling. Provided suffi- 
cient cooling is available it is possible to replace the xenon flashtube by a krypton one or a 
quartz-halogen lamp and to operate the laser continuously. 

A glance at the energy level diagram, Fig. 5.15, shows that the maximum possible power 
efficiency of the laser, v 2I /v 03 , is about 80%. In practice, because of losses in the system (which 
include the loss in converting electrical to optical energy in the pumping source, the poor coup- 
ling of the pumping source output to the laser rod and the small fraction of the pumping radi- 
ation which is actually absorbed), the actual power efficiency is typically 0. 1 %. Thus, a laser 
pumped by a flash lamp operated by the discharge of a 1000 uF capacitor charged to 4-5 kV 
(i.e. an input energy of about 10 kj) may produce an output pulse of about 10 J. As the pulse 
lasts for only about 0.5 ms, the average power, however, is then about 2 x 10 4 W. The peak 
power may be greatly increased by the technique of ^-switching, which is described in section 
6.4. 

Nd:CLASS LASER 

Both silicate and phosphate glasses have high optical homogeneity and provide excellent 
host materials for neodymium. Local electric fields within the glass modify the Nd 1+ ion 
energy levels in much the same way as the crystal field in YAG. The Nd u ions, however, 
may be in a variety of slightly different environments causing the linewidth to be much 
broader than in YAG by a factor of about 50, thereby raising the threshold pumping power 
required for laser action (eq. 5.28 showed that the threshold pump power is proportional to 
Av). In consequence, Nd:glass lasers are operated in the pulsed mode and the output spec- 
tral linewidth is greater than in Nd: YAG. On the other hand, glass can be doped more heavily 
than YAG (6% as opposed to 1.5%) and up to three times as much energy can therefore be 
produced by Nd:glass lasers; it is also much easier and cheaper to prepare glass rods than 
to grow YAG crystals. 

RUBY LASER 

The basic principle of operation of the ruby laser is the same as that of the Nd:YAG laser. The 
active medium is a synthetically grown crystal of ruby, that is aluminium oxide, with about 
05% by weight of chromium as an impurity. Chromium ions, Cr u , replace aluminium ions 
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in the lattice and the crystal field partially removes the degeneracy of the isolated ions to provide 
levels for pumping and for the laser transitions. In this case, some of the energy levels of the 
Cr 3+ ions are almost independent of the crystal field and they remain sharp. Other levels, 
however, are strongly dependent on the crystal field so that lattice vibrations, which cause fluc- 
tuations in the crystal field, broaden these levels quite substantially. The 2 E and 4 A 2 levels remain 
sharp while the 4 T I and 4 T 2 levels become broad as shown in Fig. 5.17 (the symbols in this 
case are derived from crystal field theory, ref. 5.12). Thus the pump transitions are spectrally 
broad while the transitions R, and R 2 are narrow. The energy level diagram shows that ruby 
is basically a three-level system. As explained previously, rather more than half of the total 
number of ions have to be pumped to level 2 via level 3 to create a population inversion. Thus 
the laser has a very low efficiency compared with a four-level system such as Nd:YA(j. 
Pumping is achieved through the absorption of the green and blue spectral regions of a white 
light discharge; this absorption, of course, accounts for the colour of ruby. 

DIODE-PUMPED LASERS 

It was mentioned earlier in this section that, because of poor optical coupling and a poor 
match of the pumping source emission spectrum to the absorption bands of the Nd ion, 



Energy (eV 




= 694.3 nm 
= 692.7 nm 



FIG. 5.1 7 Three-level ruby laser system. Pumping is due to the Cr 3+ ions absorping blue (excitation to 4 T, 
levels) and green light (excitation to "T 2 levels). The wavelengths of the R, and R 2 laser lines are 
temperature dependent, the values given are typical. (Some energy levels not directly involved in the laser 
transition have been omitted for clarity.) 
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the overall efficiency of NdrYAG and glass lasers, despite having four-level energy level 
systems, is less than 1%. Significant improvement of the overall efficiency can he achieved 
by pumping the Nd:YAG with semiconductor diode lasers, which as we shall see in section 
5.10.2 are themselves very efficient. Although we have not yet discussed such lasers, for 
the present purpose we can regard them as sources of nearly monochromatic radiation whose 
emission wavelength can be tuned by, for example, changing their temperature. 
GaAs/GaAl As lasers can emit at a wavelength of 808 nm which coincides with a very strong 
absorption peak in NdrYAG, so that very efficient pumping can be achieved. This readily 
permits CW operation without the need for water cooling, with the added benefit of a sub- 
stantial reduction in the overall size of the laser and much improved frequency stability (see 
section 6.1). 

Pumping is usually carried out longitudinally, that is the light from the laser diodes enters 
through one or both ends of the YAG crystal as shown in Fig. 5.18. The cavity end mirrors 
have very high reflectances at the laser wavelength of 1.06 urn, but are transparent at the 
pumping wavelength of 808 nm. Even more compact devices can be made in which dielec- 
tric reflecting coatings are applied to the ends of the laser rod. 

With the advent in recent years of high power laser diode bars and arrays (see section 
5.10.2) comprising several devices in a linear or two-dimensional array operating simulta- 
neously, CW outputs of 20 W or more can be achieved from diode-pumped Nd:YAG lasers. 
In these cases the light from the diode arrays can be efficiently collected using optical fibers 
which transform the elliptical cross-section of the output beam of laser diodes into a circu- 
lar beam which is suitable for end pumping. In this way some 65% of the output of the diode 
array can be coupled into the laser crystal. Alternatively the laser crystal can be pumped trans- 
versely using many diode arrays arranged symmetrically around the laser rod (ref. 5.13). 

5.10.1.1 Glass fiber laser 

An interesting extension of the idea of an end-pumped solid rod laser is the fiber laser (ref. 
5.14). These use optical fiber instead of solid rods. Optical fibers will be dealt with in detail 
in Chapter 8; however, here we may simply regard them as being small diameter 'flexible' 
rods consisting of a narrow diameter 'core' region surrounded by a somewhat thicker 
'cladding'. Light may readily propagate down the core region of such fibers with minimal 
sideways loss, even when the fiber is bent. The core of the fibers, which are usually made 
from silica (Si0 2 ), may be doped with Nd u to create a lasing medium. A laser cavity may 
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FIG. 5.18 Diagram of a longitudinal or end-pumped, diode-pumped, solid state laser - here Nd:YAC. 
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FIG, 5.19 End-pumped fiber laser. 
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FIG. 5.20 Cladding-pumped fiber laser - the pumping radiation is effectively captured by the outer core 
and coupled into the inner core. 

be formed by butting dielectric mirrors up against the ends of the fiber as shown in Fig. 5.19. 
The so-called cladding pumping technique (Fig. 5.20) helps to overcome the difficulty of 
efficiently coupling the laser light into the extremely narrow fiber core, by using an outer, 
undoped, core of much larger area to collect the pump light. Nd-doped fiber lasers of this 
type may emit 5 W or more in a single transverse mode. It should perhaps be noted that in 
relation to the fibers described in Chapter 8 cladding pumping is a misnomer as in reality 
the light is collected by an outer core and not the cladding. Alternatively the pumping light, 
at a wavelength of 810 nm, may be introduced into the fiber core using one of the techniques 
described in section 8.5. While there are several applications for fiber lasers their most sig- 
nificant contribution to date has been in fiber optic communications, where fiber amplifiers, 
rather than oscillators, are beginning to make a major contribution (see section 9.3). 



5. 10. 1.2 Vibronk lasers 

Vibronic lasers such as alexandrite and Ti:sapphire are superficially similar to other solid 
state lasers such as Nd:YAG in that light from an external pump source excites impurity ions 
in a transparent host. They are fundamentally different, however, in that laser gain is pos- 
sible over a broad range of wavelengths so that they can produce either tunable outputs or 
ultrashort pulses (see Chapter 6). The reason for this is that the electronic energy levels 
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involved in the la.ser transition in vibronic lasers are broadened into bands of vibrational sub- 
levels, corresponding to discrete quantized lattice vibration energies. Thus transitions can 
take place over a range of energy differences between the upper and lower levels giving a 
corresponding range of wavelengths. This behaviour is rather like that of the dye lasers dis- 
cussed in section 5.10.4. 

Most vibronic solid state lasers have four-level systems that operate in much the same 
way (ref. 5.15). The pumping radiation excites the active ions to a vibronic band. The ions- 
then lose vibrational energy and drop to the bottom of the band, which is the upper laser 
level. The laser transition then occurs to a vibrational^ excited sublevel of the ground elec- 
tronic state; this is followed by the ions relaxing to the lowest sublevel of the ground state- 
by releasing vibrational energy as illustrated in Fig. 5.21, which also shows a fixed wave- 
length transition. 

Many vibronic lasers use chromium ions as the active ingredient and it is instructive to 
contrast the ruby and alexandrite laser which comprises Cr + ions in a beryllium aluminate 
(BeAl 2 4 ) host. In the case of ruby, as we saw earlier, the energy levels are discrete and we 
have a three-level system. In contrast Fig. 5.21 shows that the 4 T 2 and 4 A, levels are broad- 
ened and that in effect we have a four-level system. Pumping at wavelengths between 380 nni 
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FIG. 5.21 Energy level diagram for Cr ! ' ions in alexandrite (BeAf 4 > - some energy levels not directly 
involved in the laser transitions have been omitted for clarity. 
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and 630 nm excites the Cr + ions. Laser action then occurs between the bottom of the T 2 
band to one of the sublevels in the 4 A 2 band giving rise to emission wavelengths in the range 
701-826 nm. 

In the absence of any wavelength-selective element in the optical system the laser should 
oscillate at the peak of the gain curve. Tuning at a particular wavelength is often accom- 
plished by incorporating a birefiringent filter within the optical cavity: the filter is rotated 
to pass a very narrow range of wavelengths within the tuning range. 

5.10.2 Semiconductor lasers 

Semiconductor lasers are not very different in principle from the light-emitting diodes dis- 
cussed in Chapter 4. A p - n junction provides the active medium: thus, to obtain laser action 
we need only meet the other necessary requirements of population inversion and optical feed- 
back. To obtain stimulated emission, there must be a region of the device where there are 
many excited electrons and vacant states (i.e. holes) present together. This is achieved by 
forward biasing a junction formed from very heavily doped n and p materials. In such n - 
type material, the Fermi level lies within the conduction band. Similarly, for the p + -type mate- 
rial the Fermi level lies in the valence band. The equilibrium and forward-biased energy band 
diagrams for a junction formed from such so-called degenerate materials are shown in Fig. 
5.22. When the junction is forward biased with a voltage that is nearly equal to the energy 
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FIG. 5.22 Heavily doped p-n junction: fa) in equilibrium and (b) with forward bias (the dashed lines rep- 
resent the Fermi level in equilibrium (a) and with forward bias (b». 
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FIG. r >.23 Diagram showing the active region and mode volume of a semiconducting laser. 

gap voltage Eje, electrons and hole.s are injected across the junction in sufficient numbers 
lo create a population inversion in a narrow zone called the active region (Fig. 5.23). 

The thickness t of the active region can be approximated by the diffusion length L c of the 
electrons injected into the p region, assuming that the doping level of the p region is less 
than that of the n region so that the junction current is curried substantially by electrons (sec- 
tions 2.7.1 and 2.8.2). For heavily doped GaAs at room temperature L v . is I -3 u.m. 

In the case of those materials such as GaAs which have a direct bandgap (section 2.4) the 
electrons and holes have a high probability of recombining radiatively. The recombination 
radiation produced may interact with valence electrons and be absorbed, or interact with elec- 
trons in the conduction band thereby stimulating the production of further photons of the 
same frequency (y = Ejh). If the injected carrier concentration becomes large enough, the 
stimulated emission can exceed the absorption so that optical gain can be achieved in the 
active region. Laser oscillations occur, as usual, when the round trip gain exceeds the total 
losses over the same distance. In semiconductors, the principal losses are due to scattering 
at optical inhomogeneities in the semiconductor material and free carrier absorption. The 
latter results when electrons and holes absorb a photon and move to higher energy states in 
the conduction band or valence band respectively. The carriers then return to lower energy 
states by non-radiative processes. 

In the case of diode lasers, it is not necessary to use external mirrors to provide positive 
feedback. The high refractive index of the semiconductor material ensures that the reflectance 
at the material/air interface is sufficiently large even though it is only about 0.32. 

EXAMPLE 5-4 Reflectance at a GaAs/air interface 



We may confirm that the reflectance at a GaAs surface is quite high by virtue of the 
high refractive index (=3.6) of GaAs. From the Fresnel equations (section 8.1) we 
have 

/n-,- n,\ 2 /3.6-r' 



/? = 



ih + n. 



3.6 + 1 



= 0.32 
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The diode is cleaved along natural crystal planes normal to the plane of the junction so 
that the end faces are parallel; no further treatment of the cleaved faces is usually necessary, 
although occasionally optical coatings are added for various purposes. For GaAs, the junc- 
tion plane is (100) and the cleaved faces are (1 10) planes. 

The radiation generated within the active region spreads out into the surrounding lossy 
GaAs, although there is, in fact, some confinement of the radiation within a region called 
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FIG. 5.24 Schematic construction of a GaAs homojunction semiconductor diode laser having side lengths 
200-400 urn (a). The emission is confined to the junction region. The narrow thickness d of this region 
causes a large beam divergence. The very small change in refractive index in the junction region is shown 
in (b) and (c) shows the resulting poor confinement of the optical radiation to the gain region. 
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Ihe mode volume (Fig. 5.23). The additional carriers present in the active region increase 
its refractive index above that of the surrounding material, thereby forming a dielectric wave- 
guide (see Chapter 8). As the difference in refractive index between the centre waveguid- 
ing layer and the neighbouring regions is only about 0.02, the waveguiding is very inefficient 
and the radiation extends some way beyond the active region, thereby forming the mode 
volume. The waveguiding achieved in simple homojunction laser diodes of the form shown 
in Fig. 5.24 only works just well enough to allow laser action to occur as a result of very 
vigorous pumping. Indeed homojunction lasers can usually only be operated in the pulsed 
mode at room temperature because the threshold pumping current density (sec below) 
required is so high, being typically of the order of 400 A mm~\ 

The onset of laser action at the threshold current density is detected by an abrupt increase in 
the radiance of the emitting region, as shown in Fig. 5.25, which is accompanied by a dramatic 
narrowing of the spectral width of emission. This is illustrated very clearly in Fig. 5.26 which 
shows the mode structure below, and at threshold, where the energy has been channelled into 
a relatively small number of modes. If the current is increased substantially above threshold one 
mode usually predominates, with a further decrease in the spectral width of the emission. 

5. 10.2.1 Threshold current density for semiconductor lasers 

An exact calculation of the threshold current for a semiconductor laser is complicated by 
the difficulty of defining what is meant by a population inversion between two bands of 
energy levels. To simplify the problem, however, and to gain some insight into the impor- 
tant factors, we use the idealized structure shown in Fig. 5.23. We let the active volume, 
where population inversion is maintained, have thickness t and the mode volume, where 
the generated electromagnetic mode is confined, be of thickness d {d > t). In other lasers, 
the mode volume is usually smaller than the volume within which population inversion 
is maintained. 
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FIC. 5-25 Light output-current characteristic of an ideal semiconductor laser. 
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FIG. 5.26 Emission spectrum of a CaAlAs laser diode both just below (a) and just above (b) threshold. 
Below threshold a large number of Fabry-Perot cavity resonances can be seen extending across a wide 
LEtMype spectrum. Above threshold only a few modes close to the peak of the gain curve oscillate. For the 
particular laser shown here the threshold current was 37 mA while spectra (a) and (b) were taken with 
currents of 35 mA and 39 mA, respectively. 



A consequence of the situation in semiconductor lasers is that the portions of the mode 
propagating outside the active region may be absorbed. This offsets to some extent the gain 
resulting from those parts of the mode propagating within the active region. We allow for 
this by assuming that the effective population inversion within the mode volume tfx Ixw) 
is given by reducing the actual population inversion in the active region by the factor t/d. 

Referring to eq. (5.23), the threshold condition will thus be reached when 
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We next assume that within the active region we can ignore /V,, that is there is a large number 
of holes in the valence band; hence, 
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If the current density flowing through the laser diode is ./ A m~\ then the number of elec- 
trons per second being injected into a volume / (i.e. a region of thickness / and of unit cross 
sectional area) of the active region isJ/e. Thus the number density of electrons being injected 
per second isj/et electrons s" m -\ The equilibrium number density of electrons inihe con 
duction band required to give a recombination rate equal to this injection rate is N Ix where 
x, is the electron lifetime (x c is not necessarily equal to t,,. the spontaneous lifetime since 
some non-radiative recombination mechanisms are likely to be present). 
The threshold current density is then given by 



(•/), h _ (W 2 ).h 
ei x. 



and substituting from eq. (5.38) we have 
et d (8jtv ( ^ lh x 21 /lv/? 2 ) 



(J) "'z ., 



EXAMPLE 5.5 Threshold current density in a GaAs faser 

We may use the following data to estimate the threshold current density of a GaAs 
junction laser: wavelength, X = 0.84 jim; transition linewidth, Av = 1.45x 10" Hz- loss 
coefficient, y=3.5 x 10 3 m" 1 ; refractive index, « = 3.6; dimensions, / = 300um, 
d-2 jim; and internal quantum efficiency, r\ ~\. 

Taking ;? = 3.6 gives R = 0.32. The threshold gain is given by eq. (5.17) as 

(A) «- y+ 27 ln (^;) 



Therefore 



1 



(*) th = 3500 + — —rln 



8x 10- 4 '""^0.32y 
that is 

(*) th = 7298 m- 1 
Hence, from eq. (5.39), 

(f) th = 15.5 xl0 6 Am"= 15.5 A mrrT 2 

This value is in reasonable agreement with those measured at low temperature in 
GaAs lasers. 



Substituting for k th from eq. (5.17) then gives 
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The ratio T e /x 2l in this equation is often written as r|,. the internal quantum efficiency, which 
is the fraction of the injected electrons (or holes) which recombine radiatively. 

5. 10.2.2 Power output of semiconductor lasers 

A discussion of power output and saturation in semiconducting lasers is basically the same 
as that for other lasers given in section 5.8. As the injection current increases above thresh- 
old, laser oscillations build up and the resulting stimulated emission reduces the population 
inversion until it is clamped at the threshold value. We can then express the power emitted 
by stimulated emission as 

r\.hv 
P = A[J-{J) {h ]^- 
e 

where A is the junction area. 

Part of this power is dissipated inside the laser cavity and the rest is coupled out via the 
end crystal faces. These two components are proportional toy and (1/2/) \n(\IR x R 2 ) res P ec " 
lively. Hence we can write the output power as 

^ A[J-(J)M:v [(l/2/)ln(l//?^ z )] {5A0) 

"" e y+(l/2/)ln(l//?,fi 2 )] 

The external differential quantum efficiency rj ex is defined as the ratio of the increase in 
photon output rate resulting from an increase in the injection rate (i.e. carriers per second;, 
that is 

d(/y/iv) 



d\(A/e)[J-(J) lh ]\ 
From eq. (5.40) we can write r| ex as 

/ ln(l/R,) \ {5 .41) 

n " ^\ T / + ln(l//?,)j 

assuming that #,=/?,. Equation (5.41) enables us to determine the internal quantum effi- 
ciency from the experimentally measured dependence of n, ex on /; n,; in GaAs is usually in 
the range 0.7-1.0. 

Now if the forward bias voltage applied to the laser is V { , then the power input is V f AJ 
and the efficiency of the laser in converting electrical input to laser output is 

= J\_ ( J-(J)A (hv\ lnQ/fi,) (5 .4 2) 

1 V f AJ \ J J\eV f Jy + ln(l/*i) 

From Fig. 5 . 1 8, eV { =» hv and therefore, well above threshold (J ► (Ah) where °P timum C0U P" 
ling ensures that (l//) \n(\/R { )>y, n approaches ti 5 . As noted above, n, is ni g h (~ - 7 ) mi 
thus semiconductor lasers have a very high power efficiency. 
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5. 10.2.3 Heterojunction lasers 

As we noted above, the threshold current density for homojunction lasers is very large owing 
to poor optical anil carrier confinement, which results in the parameters d and y in cq. (5.39) 
being large. Dramatic reductions in the threshold current density to values of the order of 
]() A mm"' 2 at room temperature coupled with higher efficiency can be achieved using lasers 
containing heterojunctions (see section 2.8.6 and re f . 5.16). The properties of heterostruc- 
ture lasers which permit a low threshold current density and CW operation at room tem- 
perature can be illustrated with the double heterostructure (DH) laser illustrated in Fig. 5.27. 
[n this structure, a layer of GaAs, for example, is sandwiched between two layers of the 
ternary compound Ga^AljAs which has a wider energy gap than GaAs and also a lower 
refractive index. Both N-n-P and N-p-P structures show the same behaviour (where N 
and P represent the wider bandgap semiconductor, according to carrier type). 

Figure 5.27(b) also shows that carrier and optical confinement may be achieved simul- 
taneously. The bandgap differences form potential barriers in both the conduction and valence 
bands which prevent electrons and holes injected into the GaAs layer from diffusing away. 
The GaAs layer thus becomes the active region, and it can be made very narrow so that t is 
very small, typically about 0.2 urn. Similarly, the step change in refractive index provides 
a very much more efficient waveguide structure than was the case in homojunction lasers. 
The radiation is therefore confined mainly to the active region. In addition, the fraction of 
the propagating mode which lies outside the active region is in a wider bandgap semicon- 
ductor and is therefore not absorbed, so that y is much smaller than in homojunction lasers. 

Further reductions in threshold current can be obtained by restricting the current along 
the junction plane into a narrow 'stripe' which may only be a few micrometres wide. Such 
stripe geometry lasers have been prepared in a variety of different ways; typical examples 
are shown in Fig. 5.28. In Fig. 5.28(a), the stripe has been defined by proton bombardment 
of the adjacent regions to form highly resistive material, whereas in Fig. 5.28(b) a mesa struc- 
ture has been formed by etching; an oxide mask prevents shorting of the junction during met- 
allization to form contacts. With stripe geometry structures, operating currents of less than 
50 mA can produce output powers of about 10 mW. 

Stripe geometry devices have further advantages including the facts that (a) the radiation 
is emitted from a small area which simplifies the coupling of the radiation into optical fibers 
(see Chapter 9) and (b) the output is more stable than in other lasers. A close examination 
of typical light output -current characteristics reveals the presence of 'kinks 1 as shown in 
Fig. 5.29(a)- These 'kinks' are associated with a sideways displacement of the radiating fila- 
ment within the active region (the radiation is usually produced from narrow filaments within 
the active region rather than uniformly from the whole active region). This lateral instabil- 
ity is caused by interaction between the optical and carrier distributions which arises because 
the refractive index profile, and hence the waveguiding characteristic, is determined, to a 
certain extent, by the carrier distribution within the active region. The use of very narrow 
stripe regions limits the possible movement of the radiating filament and eliminates the 
'kinks' in the light output-current characteristics as shown in Fig. 5.29(b). The structures 
shown in Fig. 5.28 are referred to as gain guiding because the width of the gain region is 
determined by the restriction of the extent of the current flow, which of course creates the 
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(b) 
FIC. 5.27 Diagram illustrating the action of single (a) and double (b) heterojunction structures in confining 
the carriers and radiation to the gain region (as before, in the diagrams of the energy bands, the dashed 
lines represent the Fermi levels after forward bias has been applied). 
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FIG. 5-28 Schematic cross-section tend viewl of two typical stripe geometry laser diodes: l,i! the sttipe is 
defined by proton bombardment of selected regions to form high resistivity material; (b) the stripe is 
formed by etching a mesa structure and then GaAlAs is grown into the previously etched oulsides of the 
<i( tive region to form a 'buried stripe' structure. 
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FIG 5.29 Light output-current characteristics of (al a laser showing a lateral instability or 'kink' and (b) 
tripe lasers, in which the 'kinks' have disappeared. 

population inversion, and hence the gain, within the active region. Alternatively stripe geom- 
etry lasers can be fabricated using index-guided structures, in which an optical waveguide 
is created (section 8.2) as illustrated in Fig. 5.30(a). Here a buried heterostructure has been 
formed, in which the active region is surrounded on all four sides by material of lower refrac- 
tive index. The creation of such structures in practice is quite complex; a relatively simple 
one is shown in Fig. 5.31(a). One relatively straightforward alternative is to change the thick- 
ness of the semiconductor layer next to the waveguide (Fig. 5.30b) which creates an effec- 
tive refractive index difference between the active region and those next to it in the same 
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FIG. 5.30 Schematic representation of (a) a buried heterostructure which acts as a waveguide (end view), 
and (b) ,1 structure which behaves like a buried heterostructure; the varying thickness of the layer next to 
the guiding layer creates changes in the apparent refractive index, thereby achieving a waveguiding 

structure. 
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FIG. 5.31 Buried heterostructure index guiding laser structures: (a) based on InCaAsP (and 
shown in Fig. 5.30a); (b) based on CaAs (and the structure shown in Fig. 5.30b). 



the structure 



layer. A device based on this technique is shown in Fig. 5.31(b). Several other buried layer 
heterostructure devices are described in ref. 5.17. 

In general gain-guided lasers are easier to fabricate than index-guided lasers, but their 
poorer optical confinement limits the beam quality, and makes stable, single mode opera- 
tion difficult to achieve. On the other hand the fact that the beam spread is greater reduces 
the optical power density at the output face thereby reducing the risk of damage (see below). 

Although the application of diode lasers to fiber optical communications will be discussed 
in detail in Chapter 9 it is perhaps appropriate to mention at this stage one or two points of 
relevance. These include the temperature dependence of the threshold current, output beam 
spread, degradation and the use of materials other than GaAlAs. 

The threshold current density 7 Ih increases with temperature in all types of semiconduc- 
tor laser but, as many factors contribute to the temperature variation, no single expression 
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is valid for a " devices and temperature ranges. Above room temperature, which is usually 
the region of practical interest, it is found that the ratio of J lh at 70°C to J lh at 22°C for GaAl As 
lasers is about 1.3-1.5 with the lowest temperature dependence occurring for an aluminium 
concentration such that the bandgap energy difference is 0.4 eV. Typical light output-cur- 
rent characteristics for a GaAlAs DH laser are shown in Fig. 5.32. 

The angular spread of the output beam depends on the dimensions of the active region 
and the number of oscillating modes (which, in turn, depends on the dimensions of the active 
region, the refractive index and the pump power). For wide active regions, we find that the 
beam divergence both parallel to (6||) and perpendicular to (e x ) the plane of the junction is 
given approximately by simple diffraction theory. Thus, normal to the junction plane we have 
9 « 1 .22X/d. For DH lasers, where the active region is much narrower, 6 1 is given approx- 
imately by 9 X - 1- 1 * 10'V'A). where x is the mole fraction of aluminium. Thus for a DH 
laser with r = 0. 1 urn, x = 0.3 and X = 0.9 pm, we find e 1 = 37° (in good agreement with experi- 
mental observations). 

Until recently, the system Ga, _ , Al, As/GaAs was the most widely investigated and used 
for the production of DH lasers. There are many reasons for this, including the facts that (a) 
GaAs is a direct bandgap semiconductor which can easily be doped n- or p-type; (b) the 
ternary compound Ga, _, AI, As can be grown over a wide range of compositions, and not 
on ly does it have a very close lattice match to GaAs (=0.1%) for all values of a- (thus there 
is low interfacial strain between adjacent layers and consequently very few strain-induced 
defects at which non-radiative recombination may occur), but it is also a direct bandgap semi- 
conductor for x<0.45; and (c) the relative refractive indices and bandgaps of GaAs and 
r AI As provide for optical and carrier confinement. 

In optical fiber communications, however, it is desirable to have a laser emitting at wave- 
lengths in the region 1.1 to 1.6 pm where present optical fibers have minimum attenuation 
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and dispersion. Wavelengths in this range can be obtained from lasers fabricated from qua- 
ternary compounds such as Ga v In,_,As,_ v P v because of the wide range of bandgaps and 
lattice constants spanned by this alloy. Figure 5.33 shows the lattice constant variation with 
bandgap (and emission wavelength) for this alloy. By suitable choice of x and v, exact lattice 
matching to an InP substrate can be achieved and strain-free heterojunction devices can be 
produced. The GalnAsP layers may be grown on InP substrates by liquid phase, vapour phase 
or molecular beam epitaxial methods (see ref. 5.18). A typical DH stripe contact laser diode 
of GalnAsP/InP emitting at 1 . 1 - 1 .3 u.m is shown schematically in Fig. 5.34. 

The question of laser reliability is also important in relation to applications such as telecom- 
munications. Laser life may be limited by 'catastrophic' or 'gradual' degradation. 
Catastrophic failure results from mechanical damage to the laser facets due to too great an 
optical flux density. The damage threshold is reduced by the presence of flaws on the facets; 
however, it may be increased by the application of half-wave coatings of materials such as 
Al 2 O v While facet damage is more likely in lasers operating in the pulse mode, it can also 
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occur in CW-operated lasers. This is so especially in the central portion of the active region 
of stripe lasers where the optical flux density is greatest. Uncoated lasers with stripes about 
20 um wide tend to fail catastrophically when the optical flux exceeds about 10 y W itT 2 . 

Gradual degradation depends principally on the current density, but also on the duty cycle 
and fabrication process. It has been observed that as time elapses the threshold current density 
increases, 'dark' lines develop in the emission and then the CW output falls off drastically. 

The development of dark lines is apparently related to the formation, in the vicinity of 
the active region, of so-called dark-line defects, which act as non-radiative recombination 
centres. Dark-line defects are attributed to defects such as dislocations, which may have a 
number of sources. These include (a) edge dislocations formed to relieve stress caused by 
interfacial lattice mismatch, (b) bonding of the laser to the heat sink and (c) impurities intro- 
duced during substrate preparation and heteroepitaxial wafer growth. Furthermore, the energy 
released by non-radiative electron-hole recombination may result in the creation and migra- 
tion of defects. 

Defects may be formed in the active region during device fabrication or penetrate into it 
during subsequent operation. Dark-line defects may grow owing to a process called dislo- 
cation climb (i.e. the movement of dislocations involving atomic transport to or away from 
he dislocation) and extend throughout the device structure. 

Dislocation growth may be stimulated by carrier injection and recombination. GaAs lasers 
nitially containing dislocations are found to degrade at a much higher rate than those that 
are initially dislocation free. Furthermore, devices with exposed edges that contain edge 
defects also degrade more rapidly than those in which recombination is restricted to inter- 
nal regions of the crystal. 

Thus, to produce lasers with long lifetimes great care must be taken with substrate selec- 
. _ an d wafer processing and crystal growth must be carried out under ultraclean condi- 
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tions to fabricate a laser with a strain-free structure. Despite these problems, lasers with life- 
times in excess of 40 000 hours are now available corresponding to continuous operation 
over a 5 year period. 

5. 70.2.4 Quantum well lasers 

Quantum well structures were discussed in section 2.9, where it was shown that in very 
narrow semiconductor layers (i.e. the quantum wells) there is a very significant increase 
in the density of states near the bottom of the conduction band and the top of the valence 
band. The increased densities of states enable a population inversion to be obtained more 
easily and, as a consequence of this, and the very small active volume, the threshold currents 
in quantum well lasers are about a factor of 10 less than those in DH lasers. In addition, 
quantum well lasers have low temperature sensitivity and their output characteristics are 
free from kinks. Such lasers are therefore increasingly replacing DH lasers as materials 
growth technology improves enabling the controlled fabrication of very thin structures in 
an increasingly wide range of semiconductors (ref. 5.19). 

One of the problems with the single quantum well (SQW) structure described above is 
that, because of the extreme narrowness of the active region, optical confinement is very 
poor. This causes higher losses and tends to negate the potential advantages of low thresh- 
old currents. One way of reducing these problems is to use the multiple quantum well (MQW) 
structure illustrated in Fig. 5.35(b), which because of its greater thickness gives better optical 
confinement and beam definition. 

The single quantum well can be extended to coupled quantum wells, to form the MQW 
laser, Figs 5.35(a) and (b). In such devices several GaAs quantum wells, for example, may 
be coupled by very thin intervening GaAlAs barrier layers. The overall active region is now 
thicker so that the carriers which are not captured and therefore able to recombine in the first 
well may be captured by the second or a subsequent well. Although MQW lasers have larger 
threshold currents than single quantum well lasers, where the threshold current may be as 
low as 1 mA or less, they can emit more optical power, and their structure results in better 
optical confinement. 

Further improvement in both optical and carrier confinement can be obtained by adding 
cladding layers and separate confinement heterostructure (SCH) layers as illustrated in Fig. 
5.35(b) (ref. 5.19b). The SCH layers are chosen to have a refractive index which is greater 
than that of the cladding layers, so that total internal reflection occurs at the boundary. The 
SCH layers also, together with the barrier layers, have an energy gap, E % , between that of 
the cladding layers and quantum wells, so that the charge carriers are confined between the 
cladding regions - hence the SCH layers are so named because the carriers and photons are 
separately confined. 

The cladding layers are doped n- and p-type, while the MQW layers are undoped. Under 
forward bias the electrons and holes are injected from the cladding layers, diffuse across the 
SCH layers and enter the MQW structure where they recombine. The cavity mirrors are pro- 
vided by the high reflectance of the device faces. Alternatively multilayer coatings can be 
added or the techniques described in section 6.2 can be used. 

The lasing region of the active layer can be restricted to a narrow strip thereby in effect 
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FIG 5-35 ^ Stimulated emission in a single quantum well. Some of the carriers escape (thin arrows) 
without contributing to the process, (b) The energy band diagram for a typical multiple quantum well 
(MQW) laser with separate confinement heferostructure (SCH) layers. The light waves are transversely 
juided between the cladding layers, either by changes in the refractive index (index guiding), or by current 
injection from a stripe contact (gain guiding). By using InCaAsP, InCaAs or InGaAlAs semiconductor 
stems grown on InP substrates the wavelength range from 920 nm to 1 650 nm can be covered. 



confining the carriers in two dimensions. Such structures are referred to as quantum wire 
icrocavities, and are the basis of QWR-MC lasers. Further restriction, that is into three 
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dimensions, gives rise to quantum dot lasers. Despite manufacturing difficulties quantum 
wire and quantum dot arrays are potentially important because, in addition to very low thresh- 
old currents, they have very high modulation bandwidths, narrow spectral linewidths and 
low temperature sensitivity. 

70 2,5 Arrays - vertical cavity lasers 

The output power from semiconductor lasers may be increased by using one-dimensional 
rravs of single mode lasers on a bar of semiconductor as shown in Fig 5.36(a). Such arrays 
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FIC. 5.3b A linear array of lasing elements formed within a single semiconductor bar: <a) shows the stripe 
contacts, which define the lasing regions; (b) shows the electric field distribution; the field is zero midway 
between the lasing elements where there is absorption rather than gain. 



are called phased arrays since the electric fields associated with the individual elements inter- 
act with each other resulting in definite phase relationships between them. Frequently the 
phase difference between adjacent elements is arranged to be 180° (Fig. 5.36b), so that the 
resultant field midway between the active regions of adjacent elements is zero. These midway 
regions are more likely to exhibit absorption rather than gain so the overall losses are min- 
imized. Unfortunately, this phase relationship results in a power distribution in the plane of 
the active layer with an angular distribution comprising two lobes rather than a single one. 
In fact a single-lobed power output distribution can be achieved if the phase between adja- 
cent elements is zero. The phase difference can be controlled by a number of techniques 
including variation of the lateral spacing between the elements in the array (ref. 5.20). 

Linear arrays are available in widths up to 10 mm and can generate CW powers up to 20 W. 
Outputs of 10 kW or more can be achieved by stacking up to 200 linear array bars together. 
It is important to realize that as the power output increases, so too do the cooling require- 
ments; it is therefore vital to consider carefully how to remove excess heat to prevent the 
array from self-destructing. Very high power arrays, for example, require water cooling. 

VERTICAL CAVITY LASERS 

A structure which particularly lends itself to the fabrication of laser arrays is the vertical 
cavity surface-emitting laser ( VCSEL) (ref. 5.20). While in traditional, horizontal edge-emit- 
ting lasers the resonant cavity is in the plane of the active layer, in VCSELs (Fig. 5.37) it is 
perpendicular to this plane. The light resonates between mirrors on the top and bottom of 
the laser wafer so that the photons pass through only a very short length (typically < 1 urn) 
of active medium, in which they can stimulate emission. Thus VCSELs have very much lower 
round trip gain than horizontal edge-emitting lasers, and consequently require highly reflect- 
ing mirrors (reflectance >0.9) to sustain oscillations. Clearly the reflectance of the semi- 
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conductor facets at about 0.32 is insufficient and multilayer mirrors comprising several tens 
of alternate k/4 coatings of AIAs and AlGaAs are often used as illustrated in Fig. 5.37 

fref- 5.200- 

The active layer comprises an SQ W or MQW structure, which together with cladding and 
confinement layers forms an optical cavity which is one wavelength thick. The active region 
is arranged to be at the peak of the standing wave formed between the mirrors. 

All vertical cavity lasers emit from their surface rather than their edge (though surface- 
emitting lasers are available which do not have vertical cavities). The emission is typically 
from round or .square areas which are about 1 urn wide so that the output beams are highly 
symmetrical in contrast to those of edge-emitting devices. Divergence angles are only 7° to 
10°, and by using microlense.s integrated onto the device surface some 90% of the output 
may be coupled into optical fibers. 

In addition to the symmetrical beam profile, low threshold currents and good tempera- 
ture stability of VCSELs, a major attraction of surface emission is the ability to fabricate 
monolithic one- and two-dimensional arrays of many elements. In practice it is possible to 
grow many thousands of VCSELs simultaneously on a 3 inch (75 mm) wafer and, equally 
importantly in relation to manufacturing costs, to test these and measure the optical and elec- 
trical properties in situ. 

A range of one-dimensional (up to 64 elements) and two-dimensional (8 x 8) VCSEL 
arrays i- s noW commercially available, with much larger arrays under development. Each laser 
• the ar ray can be independently addressed so that, for example, the lasers in an array can 
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■Mt^jlurces for several parallel communication channels (ref. 5.20b), particularly as ver- 
' t\i%£+tff'ny lasers have very high modulation band widths. 

. • , -, yEprtiLs are currently available in the wavelength range 650-690 nm using GaAs/GaAlAs 
[- ^ v ; and1?5()-980 nm using InGaAs/GaAs semiconductor systems. Unfortunately efforts to fab- 
ricate VCSELs operating CW at room temperature in the wavelength range 1300- 1550 nm, 
which is so important for long range fiber optic communications, have not yet succeeded. 

5.10.2.6 Short wavelength lasers 

Recently there has been increased demand for shorter wavelength semiconductor lasers for 
applications such as compact disc and optical storage, colour printing and semiconductor 
lithography. The shorter the wavelength the smaller is the area of a focused beam (~X ) 
thereby allowing increased storage capacity, and similarly the narrower the features than can 
be created with optical lithography. 

Recently red lasers based on AlGalnP have become available for use in barcode readers, 
while quantum well lasers with GalnP active layers have enabled wavelengths as short as 
630 nm to be generated. 

Despite the improved reliability of semiconductor lasers emitting in the red and yellow 
parts of the spectrum, reliable lasers emitting in the green and blue remain elusive. However, 
recent improvements in materials technology have enabled CW. room temperature opera- 
tion to be demonstrated in so-called II— VI semiconductors such as ZnSe, ZnMgSSe and 
related compounds on GaAs substrates. Alternatively CW laser operation at a wavelength 
of 4I7 nm has been obtained from devices based on gallium nitride (GaN) which is a rather 
difficult material to work with (ref. 5.2 la). These lasers contain an MQW structure of 26 
quantum wells 2.5 nm thick of In^Ga^N separated by layers of In 00 < j Ga 09 ,N barriers 5.0 nm 
thick giving a total thickness of some 200 nm. The threshold current densities and operat- 
ing voltages are still rather high at about 10 kA cm" 2 and 25 V respectively, but these are 
being steadily reduced as the technology develops. 

The requirement of close lattice matching (i.e. =0.1%) for the components in a hetero- 
junction structure made it difficult to cover some wavelength ranges. A recent development 
which has helped in this respect is the discovery that very thin layers (less than a few tens 
of nanometres) can accommodate a lattice mismatch of more than 1 %. These layers are called 
strained lattice layers, and the technique was first used to enable the fabrication of 
InGaAs/GaAs lasers emitting at 980 nm. Strained layers are also used in quantum well struc- 
tures to produce active layers which need not be precisely matched to the surrounding layers. 
This technique was used to produce the lasers based on ZnSe, which emit in the green at a 
wavelength of 525 nm (ref. 5.21b), and to enable GaN to be grown on mismatched substrates 
such as sapphire, which has the same crystal structure, or silicon nitride. 

5. 70.2. 7 Superluminescen t light-emitting diodes 

We end this section with a discussion of a device which, while not a laser, does depend on 
optical amplification, namely the superluminescent light-emitting diode (SLD). SLDs have 
a structure which is rather similar to that of conventional injection laser diodes and edge- 
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emitting LEDs (ref. 5.22); indeed the SLD has optical properties which are intermediate 
between these two devices. Both stripe geometry and hurried heterostructure SLDs are avail- 
able, emitting at a range of wavelengths. In contrast to laser diodes, however, the non-output 
end of the device is made optically lossy to minimize feedback and suppress laser oscilla- 
tions. This can be achieved simply by roughening the cleaved surface of the device to scatter 
the light, or by adding an antireflection coating. 

In operation the injection current is increased until stimulated emission, and hence ampli- 
fication of spontaneous emission, just occurs. That is, operation is on the 'knee' of the laser 
diode output characteristic shown in Fig. 5.25. Although there are no oscillations the stim- 
ulated emission, within a single pass through the device, provides gain so that the device 
output increases rapidly with increase in current -this is termed supcinuliamc or supcrlu- 
niinescence. High optical output power can be obtained together with a narrowing of the spec- 
tral width, which also results from the stimulated emission. 

These characteristics of the output from SLDs give a number of advantages over con- 
ventional LEDs in relation to their use in fiber optic communications. These include: higher 
power outputs (up to 60-100 mW), a more directional light beam, and a narrower spectral 
linewidth, all of which improve the source to fiber coupling. Moreover, the superradiant emis- 
sion process within SLDs tends to increase their modulation bandwidth. In contrast to con- 
ventional LEDs, however, SLDs suffer from having a non-linear output characteristic and 
an increased temperature dependence of the output power. Compared with laser diodes they 
require substantially higher injection currents (by a factor of about three) to produce a similar 
power output. 

5.10.3 Gas lasers 

Gas lasers are the most widely used type of laser; they range from the low power 
helium-neon (He-Ne) laser commonly found in teaching laboratories to very high power 
carbon dioxide lasers, which have many industrial applications. Basically, there are three 
different classes of gas laser, according to whether the transitions are between the electronic 
energy levels of atoms, or of ions, or between the vibrational/rotational levels of molecules. 
In general, the energy levels involved in the Iasing process are well defined and the absence 
of broad bands effectively eliminates the possibility of optical pumping. Though other 
methods can be used, most gas lasers are excited by electron collisions in a gas discharge. 
We shall now consider a typical example from each of those classes mentioned above. 

5 W-3- 1 Atomic lasers - the He-Ne laser 

In the He - Ne laser the active medium is a mixture of about 1 parts of helium to one part 
of neon. The neon provides the energy levels for the laser transitions (about 150 different 
laser transitions have been observed, although only the four shown in Fig. 5.38 are reason- 
ably strong), while the helium atoms, though not directly involved in the laser transitions, 
have an important role in providing an efficient excitation mechanism for the neon atoms. 
Excitation usually takes place in a d.c. discharge created by applying a high voltage (= 2 to 
4 kV) across the gas contained in a narrow diameter glass tube at a pressure of about 10 torr, 
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FIG. 5.38 Energy levels relevant to the operation of the He-Ne laser. M indicates a metastable state. 



as illustrated in Fig. 5.39. As the discharge tube exhibits a negative dynamic resistance when 
a discharge has been initiated, it is necessary to include a load resistor to limit the current 
and protect the power supply. 

The pumping process can be described as follows. The first step is the electron impact 
excitation of helium atoms to one of two metastable states designated 2'S and 2S; this is 
represented by 

e x + He = He* + e 2 

where e { and e 2 are the electron energies before and after the collision. While in one of their 
excited states (He*), helium atoms can transfer their energy to neon atoms, with which they 
may collide. The probability of this resonant transfer of energy is proportional to 
exp(-A£/W) where A£ is the energy difference between the excited states of the two atoms 
involved. The energy level diagram for helium and neon is shown in Fig. 5.38. Here, the 
neon states are labelled by the so-called Paschen notation (the numerical subscripts have no 
spectroscopic significance and are used only as labels) while the helium states are labelled 
according to the L-S representation mentioned in section 5. 10. 1. Figure 5.38 shows that there 
is a group of four neon levels at almost the same energies as each of the two excited helium 
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states, and resonant transfer thus occurs quite readily. The energy transfer is represented by 
He* + Ne = Ne* + He 

A population inversion is thus created between the 3s and (3p, 2p) groups of levels and also 
between the 2s and 2p levels. Transitions between the 3s and 2s levels and between the 3p 
and 2p levels are forbidden by quantum mechanical selection rules. 

The He-Ne laser is another example of a four-level system, and as such we require that 
the population of the lasing transition terminal level be kept as low as possible. This implies 
that electrons in the terminal level should decay as rapidly as possible back to the ground 
state. In neon this is a two-step process: the first, 2p to Is, is a rapid transition, but the second, 
Is to the ground state, is not so rapid. The latter transition rate is, however, enhanced by 
collisions with the walls of the discharge tube, and indeed the gain of the laser is found to 
be inversely proportional to the tube radius. For this reason, the discharge tube diameter 
should be kept as narrow as possible. 

The transition 2p to Is is also of interest since it gives rise to the familiar colour of 'neon 
lights'. Thus the 2p level itself must be populated by the discharge. This is unfortunate since 
an increase in the population of the 2p level implies a decrease in population inversion (at 
least as far as the 1.15 urn, 632.8 nm and 543.5 nm wavelengths are concerned), and in fact 
this effect is to a large extent responsible for lasing ceasing at high tube currents. We cannot 
therefore increase the output simply by increasing the current indefinitely. Thus the He-Ne 
laser is destined to remain a relatively low power device. 

EXAMPLE 5.6 Efficiency of an He-Ne laser ______ 

We may estimate the efficiency of a low power He-Ne laser from the following. 

A typical laser operates with a current of 10 mA at a d.c. voltage of 2500 V and gives 
an optical output of 5 mW. Its overall power efficiency is then 



5x 10" 3 
2500x"i x 10 



-2 



= 0.02% 
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Three of the four main laser transitions (at 3.39 urn, 632.8 nm and 543.5 nm) share a 
common starting level. Thus the transitions are always competing with each other and pre- 
cautions must be taken to prevent the two unwanted wavelengths from lasing. This can be 
achieved quite easily by using the multilayer coated mirrors discussed in section 5.5, which 
have a wavelength-dependent reflectance. The very low absorption loss of such mirrors is 
an essential feature as the gain in the He-Ne medium is rather small; indeed, the use of such 
mirrors is quite general in gas lasers. 

The basic structure of the He-Ne laser is relatively simple. The essential elements are 
.shown in Fig. 5.39. The discharge is usually initiated by a high voltage 'trigger' pulse of 
some 10-20 kV, and then maintained at a current of 5 to 10 mA. The mirrors forming the 
resonant cavity are sometimes cemented to the ends of the discharge tube, thereby forming 
a gas-tight seal. Alternatively, the mirrors can be external to the tube, which is then sealed 
with glass windows orientated at the Brewster angle to the axis of the tube. This arrange- 
ment allows 100% transmission for the radiation with its electric vector vibrating parallel 
to the plane of incidence, thereby ensuring the maximum possible gain (minimum losses) 
in each round trip. The Brewster windows therefore also result in the output being plane polar- 
ized. Although this arrangement is slightly more complicated than the former one, it enables 
us to insert frequency stabilizing, mode selecting and other devices into the cavity. The 
mirrors can also be changed to allow operation with other output characteristics and at other 
wavelengths. 

The power output from He-Ne lasers is rather small (up to about 100 mW maximum, 
though more typically a few milliwatts); however, the radiation is extremely useful in a wide 
range of applications because it is highly collimated, coherent and has an extremely narrow 
linewidth. 

5.10.3.2 Ion lasers 

NOBLE GAS ION LASERS 

The most powerful CW lasers operating in the visible region are the inert gas ion lasers such 
as the argon and krypton ion lasers. CW outputs of several watts can readily be obtained while, 
if the laser is pulsed, powers up to a kilowatt in microsecond pulses can be generated. 

The gas atoms are ionized by electron collision in a high current discharge (=* 15 to 50 A). 
The ions are excited by further electron collisions up to a group of energy levels (4p) some 
35 eV above the atomic ground state. As the electron energies are only a few electron volts, 
the excitation must be the result of multiple collisions. A population inversion forms between 
the 4p levels and the 4s level, which is about 33.5 eV above the ground state (Fig. 5.40), so 
that a series of stimulated lines is emitted. These range in wavelength from 35 1 nm to 520 nm, 
although most of the energy is concentrated in the 488 nm and 514.5 nm lines. 

The tube design of the argon laser is much more complicated than that of the He-Ne laser, 
principally because of the much higher energy required to pump the ionic levels and the need 
to dissipate the heat energy released. The current density can be increased by concentrat- 
ing the discharge with a magnetic field applied along the axis of the tube (the ions spiral 
about the magnetic lines of force). This has the added advantage of reducing the number of 
ions which collide with and damage the walls of the tube. The tube is made of a refractory 
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HG. 5.40 Simplified energy level diagram for the argon ion laser. Ten or more laser Sines are produced, 
but the two shown are by far the most intense. 



material such as graphite or beryllium oxide. To dissipate the large amount of heat gener- 
ated, most ion laser tubes are water cooled and often include a series of metal disks to act 
as heat exchangers, as shown in Fig. 5.41 . Holes in the centre of these disks define the active 
region of the laser. Because of the high current involved, the cathode must be an excellent 
electron emitter and a getter is often incorporated to 'clean up' any impurities which might 
otherwise poison the cathode. 

Again, the discharge is initiated by a high voltage pulse and then maintained by a d.c. 
voltage of about 200 V. During operation, the positive ions tend to collect at the cathode 
and may eventually cause the discharge to be extinguished. To prevent this, a gas return path 
is provided between the cathode and anode to equalize the pressure. Pulsed ion lasers tend 
to be simpler and, with a low duty cycle, the heat generated is small enough to be dissipated 
by convective cooling. 

To select any desired wavelength, a small prism is inserted into the cavity and the posi- 
tion of the end mirror is changed by rotating it to be normal to the path of the radiation with 
the desired wavelength. This ensures that radiation of this particular wavelength will be 
reflected to and fro, while that of other wavelengths will be lost from the cavity after only 
a f e w round trips. 

Krypton ion lasers are becoming increasingly used as excitation sources for dye lasers 
(see below) and in physical and chemical investigations. They produce a wealth of spectral 
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lines ranging across the entire visible spectrum from about 340 nm to 800 nm, with the most 
intense line at 647 nm. 



METAL VAPOUR ION LASERS 

A group of ion lasers which is becoming increasingly important is based on discharges in 
ionized metal vapours such as gold, copper and cadmium; indeed many metallic elements 
in the periodic table can be made to lase in this way. We shall briefly describe the cadmium 
(Cd) laser, which has attracted considerable commercial interest because of its ability to gen- 
erate short wavelength. CW emission at wavelengths of 441 .6 nm and 325.0 nm. Other metal 
vapour ion lasers are described in ref. 5.23. 

In many ways the operation of the Cd ion laser is very similar to that of the He-Ne laser 
in that the excitation process also involves helium. Excited states of the Cd + ions are achieved 
following collisions between cadmium atoms and energetic helium atoms in a discharge: 

Cd + He* = Cd + * + He + e 

In this case, however, Cd + ions are also excited to even higher energies through collisions 
of cadmium atoms with energetic helium ions: 

He + +Cd = Cd + * + He 

though the resulting laser emissions are not nearly as intense as those at 325 nm and 441.6 nm. 
Metal ion lasers are technically more demanding than other gas lasers as not only does 
the metal have to be vaporized, but steps have to be taken to prevent the vapour from con- 
densing on the laser windows. One way of doing this is to use a helium plasma to separate 
the windows from the main He-Cd discharge. It is also difficult to maintain a uniform dis- 
tribution of vapour ions along the length of the discharge. The flow of vapour ions towards 
the cathode helps in this respect. A typical laser design is shown in Fig. 5.42. The cadmium 
vapour is generated at the anode end of the discharge tube in an oven held at a temperature 
of 220°C, and condensed at the cathode end. A second oven around the discharge tube main- 
tains a constant temperature in the active region. The vapour pressure of cadmium is typically 
0.002 torr, while that of helium is 3.5 torr; a discharge current of some 100 mA can then 
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FIG. . r >.42 Schematic diagram of an He-QI' vapour ion laser. 

generate powers of some 200 111W at 441 nm, and 20 mW al a wavelength of 32? ntn from 
a discharge lube 1.5 m long. 

5 10.3-3 Molecular lasers - the carbon dioxide laser 

The carbon dioxide laser is the most important molecular laser and indeed it is arguably the 
most important of all lasers from the standpoint of technological applications. In molecu- 
lar lasers, the energy levels are provided by the quantization of the energy of vibration and 
rotation of the constituent gas molecules. The C0 2 molecule is basically an in-line arrange- 
ment of the two oxygen atoms and the carbon atom, which can undergo three fundamental 
modes of vibration as shown in Fig. 5.43. At any one time, the molecule can be vibrating 
in any linear combination of these fundamental modes. The modes of vibration are denoted 
by a set of three quantum numbers (»;,, ?> 2 , ?>,) which represent the amount of energy or 
number of energy quanta associated with each mode. The set (100). for example, means that 
a molecule in this state is vibrating in a pure symmetric mode with one quantum of vibra- 
tional energy; it has no energy associated with the asymmetric or bending modes. 
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Vibrational modes of the CO, molecule. 
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FIG. 5.44 Simplified energy level diagram for the CO, laser. Each vibrational level has many rotational 
levels associated with it, Sf = 1 , 2, . . . . The 1 0.6 pm line is the strongest. 



In addition to these vibrational modes, the molecule can also rotate, and thus it has 
closely spaced rotational energy levels associated with each vibrational energy level. The 
rotational levels are designated by an integer 3>, the rotational quantum number. The energy 
separation between these molecular levels is small and the laser output is therefore in the 
infrared. The important parts of the C0 2 energy level arrangement are shown in Fig. 5.44, 
which also shows the ground state and first excited state of the vibrational modes of 
nitrogen. 

Many C0 2 lasers contain a mixture of C0 2 , nitrogen and helium in the ratio 1:4:5. Nitrogen 
plays a similar role to that of helium in the He-Ne laser. Excited nitrogen molecules trans- 
fer energy to the C0 2 molecules in resonant collisions, exciting them to the (001) levels. 
The ( 1 00) C0 2 levels have a lower energy and cannot be populated in this way, so that pop- 
ulation inversion is created between the (001) and (100) levels giving stimulated emission 
at about 10.6 urn. The helium has a dual role. First, it increases the thermal conductivity to 
the walls of the tube, thereby decreasing the temperature and Doppler broadening, which 
in turn increases the gain. Secondly, it increases the laser efficiency by indirectly depleting 
the population of the ( 1 00) level, which is linked by resonant collisions to the (020) and (010) 
levels, the latter being depleted via collisions with the helium atoms. 

While other gas lasers have efficiencies of 0. 1 % or less, the C0 2 laser may have an effi- 
ciency up to about 30%. This is essentially due to the ease with which electrons in the dis- 
charge can cause excitation and because of the strong coupling of the various levels involved. 
Because of this high efficiency, it is relatively easy to obtain CW outputs of 100 W for a 
laser 1 m long. Powers of this magnitude and greater (see below) mean that the mirrors should 
have very low absorption, while operation in the infrared also means that special materials 
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niu.si he used for windows, mirrors and other laser components. Materials which have been 
used successfully include germanium, gallium arsenide, zinc sulfide, zinc selenide and 
various alkali halides, although these suffer from being relatively sol't and hygroscopic. In 
some cases, a diffraction grating mounted on a piezoelectric transducer is used instead of 
the high reflectance mirror. The grating permits tuning of the laser output over the large range 
of distinct lines which Fig. 5.44 shows are possible. In the transition from the (001) to the 
f 100) group of levels, the selection rule A.'! = ±1 operates. Thus, for example, for .')■ = 10 in 
the upper level, .f can be 1 1 or 9 in the lower level. Then using the convention that .'I refers 
to the rotational quantum number of the lower level, the corresponding transitions give rise 
to the P 1 (AJ 1 = + 1 ) and R 1 (A5 = - 1 ) branches respectively. The strongest lines are given 
by the PI 8, P20 and P22 transitions (ref. 5.24). 

CO, LASER CONFIGURATIONS 

SEALED TUBE LASERS 

By a sealed tube laser we mean one similar to that adopted for He - Ne lasers where the dis- 
charge gases are completely sealed within the discharge tube. The problem with this design, 
as far as C0 2 lasers are concerned, is that during discharge the CO-, molecules tend to break 
down into carbon monoxide molecules, CO. This occurs at quite a high rate and dramati- 
cally shortens the lifetime of the laser. One solution to this problem is to add hydrogen or 
water vapour to the gas mixture, which react with the CO and regenerate OX Despite the 
presence of helium, which helps to conduct heat to the tube walls, gas cooling is another 
problem and generally the total power output ts not more than about 100 W, while the life- 
times have been restricted to a few thousand hours. Sealed tube designs consequently are 
not very common, but they are used in conjunction with the so-called waveguide design, in 
which the inner dimensions of the tube are small (of the order of millimetres) and form a 
dielectric waveguide. Excellent beam quality with relatively large output powers can be 
obtained in this way from compact structures. Excitation of the gas is normally obtained by 
means of an intense radio-frequency field that can penetrate the dielectric material. Recently 
improvements in the discharge technology have enabled sealed tube lasers with CW outputs 
of some 500 W and lifetimes in excess of 20 000 hours to be fabricated (ref. 5.25). 

GAS FLOW LASERS 

Both degradation and cooling problems may be alleviated by allowing the gas mixture to 
flow through the laser tube. In the simplest designs both the gas flow and electrical discharge 
occur along the tube axis. If no attempt is made to recycle the gas, then a fresh mixture must 
be supplied continuously, though this is not a serious problem given that the gas pressures 
are relatively low. Vacuum pumps draw the gases through the discharge tube either to vent 
them to the atmosphere or for recycling. 

The power output of the C0 2 laser increases approximately linearly with the length of the 
tube at a rate of about 60 W per metre. In attempts to obtain large output powers, lasers with 
tubes tens of metres in length have been built, which give powers of a few tens of kilowatts. 
Such tube lengths are generally impractical and other techniques are used to produce very 
high powers. 
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TRANSVERSELY EXCITED ATMOSPHERIC (TEA) LASERS 

In the C0 2 laser the output power can be increased by increasing both the gas pressure and 
the tube length. The problem then is that it becomes increasingly difficult to create a discharge. 
At atmospheric pressure, for example, the breakdown voltage is 1.2 kV mm" 1 and thus even 
for a laser 1 m long we would need an unacceptably high voltage. To overcome this, the dis- 
charge is struck transversely across the tube so that the discharge path length is now about a 
centimetre. This arrangement is referred to as transverse excitation at atmospheric pressure. 
The high voltage is applied to a set of electrodes placed along the tube as shown in Fig. 5.45. 
With this arrangement, peak powers in the gigawatt range can be obtained in very short pulses 
with a pulse repetition rate of about 20 Hz. To obtain a stable discharge and submicrosecond 
pulse rise times it is necessary to pre-ionize the gas before the main discharge occurs. This 
can be achieved by using either ultraviolet radiation or electron beams. Cool gas flowing through 
the lasing region further increases the population inversion and hence the power output. 

GAS DYNAMIC LASERS 

In gas dynamic lasers the population inversion is created through the application of ther- 
modynamic principles rather than via a discharge. A nitrogen -carbon dioxide mixture is 
heated and compressed and then allowed to expand very rapidly into a low pressure region. 
During heating and compression the population of the energy states reaches the Boltzmann 
distribution appropriate to the higher temperature. At high temperatures most of the energy 
is stored in the vibrational modes of the nitrogen molecule. At lower temperatures, after 
expansion into the low pressure region, resonant collisions of the nitrogen molecules with 
the carbon dioxide molecules populate the (00 1 ) state of the CO, molecules and create a pop- 
ulation inversion. With very active pumping of this type, CW output powers in excess of 
100 kW have been achieved. Gas dynamic lasers suffer from the disadvantage of large size 
and the rocket-like roar as the gas expands. 



OTHER MOLECULAR LASERS 

As the number of molecular systems with energy levels which may be suitable for laser action 
is very large, it is not surprising that there are other molecular lasers and it is likely that many 
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more will be discovered. Two such molecular lasers emitting in the ultraviolet - visible region 
have been developed, namely nitrogen and excimer lasers. 

The nitrogen laser differs markedly from the CO : laser. In the latter the transitions are 
between molecular rotational/vibrational energy levels, while in nitrogen the laser transi- 
tions are between electronic energy states and give rise to an output in the ultraviolet at 
337 nm. A requirement for CW operation is that the upper level of the lasing transition should 
have a long lifetime while the lower level should be rapidly depopulated. In nitrogen, 
however, the converse is true: the upper level lifetime is exceedingly short (of the order of 
nanoseconds) while the lower level lifetime is of the order of microseconds. Hence popu- 
lation inversion and lasing action can only be maintained for a few nanoseconds. Very fast 
rise time pumping mechanisms similar to those used for TEA lasers must be used. 

The gain in nitrogen is so large that it can be used as a simple amplifier; that is, in many 
applications it is not necessary to provide feedback (such high gain is termed supcrradiatu) 
though a high reflecting back mirror is used to collect the amplified light into a single output 
beam. Commercial nitrogen lasers are capable of producing 1 00 kW peak power pulses. They 
are often used in photochemical investigations and for pumping other lasers, for example 
dye lasers (section 5.10.4). 

In contrast to nitrogen, excimers provide a metastable excited state. An excimer (orr.vr/ted 
d'uner) refers to a molecule formed by the association of one excited atom (or molecule) with 
another atom (or molecule) which is in the ground state. If both constituents arc in the ground 
state, then at interatomic distances characteristic of molecules they repel each other. 
Consequently, the excimer readily dissociates thereby effectively reducing the population 
of the lower lasing level and increasing the ease with which population inversion is estab- 
lished. If one or both of the constituents of the excimer are rare gas atoms, the excitation 
energy is extremely large and the metastable excimer state is an important system for storing 

energy. 

Since 1972, a large number of lasers based on excimers have been developed; these cover 
the wavelength range from about 120 nm to 500 nm. Rare gas halide excimer lasers are espe- 
cially efficient with XeF and KrF giving the highest efficiencies (some 10-15%). 

Excimer lasers are usually pumped by an intense electron beam source or by a fast dis- 
charge. The electrons in the electron beam are accelerated until they have energies of 1 MeV 
and then transmitted to the laser chamber in pulses giving beam currents of about 100 kA. 
Electron beam generators with this sort of capability are rather large and it is more conve- 
nient to pump high pressure mixtures of the rare gas and halogen by collisional reactions in 
a fast pulsed discharge using a TEA configuration. Such lasers can generate some 100 W 
f average quasi-CW power in 30 ns pulses at rates of 1 kHz. 

5 10.4 Liquid dye lasers 

Liquids have useful advantages in relation to both solid and gas laser media. Solids are very 
diffic ult to prepare with the requisite degree of optical homogeneity and they may suffer per- 
manent damage if overheated. Gases do not suffer from these difficulties but have a much 
mailer density of active atoms. Several different liquid lasers have been developed, but the 
most important is the dye laser (ref. 5.26). It has the advantage that it can be tuned over a 
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FIC. 5.46 Absorption and emission (fluorescence) spectra of a typical dye laser. 



significant wavelength range. This is extremely useful in many applications such as spec- 
troscopy and the study of chemical reactions. 

The active medium is an organic dye dissolved in a solvent. When the dye is excited by 
short wavelength light it emits radiation at a longer wavelength, that is it fluoresces. The 
energy difference between the absorbed and emitted photons ultimately appears as heat - 
typical absorption and emission spectra are shown in Fig. 5.46. The broad fluorescence spec- 
trum can be explained by the energy level diagram of a typical dye molecule. As Fig. 5.47 
shows, the molecule has two groups of closely spaced electronic energy levels: the singlet 
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FIG. 5.47 Energy level scheme for a dye molecule. The laser transitions terminate above the lowest 
energy in the S„ singlet state, so the laser is a four-level system. S,— »T transitions lead to strong absorption 
at the laser wavelengths, thereby quenching laser action. S, — »S 2 transitions may also quench laser action in 
some dyes. 
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states (S„, S, and S,) and the triplet states (T, and T,). The singlet states occur when the total 
spin of the excited electrons in each molecule is zero (the value of 25+ 1 is thus unity). The 
triplet states occur when the total spin is unity (25+ 1 = 3). Each electronic energy level is 
broadened into a near continuum of levels by the effects of the vibration and rotation of the 
dye molecule and also by the effects of the solvent molecules. Pumping results in the exci- 
tation of the molecule from the ground state S to the first excited state S,. This is followed 
by very rapid non-radiative decay processes to the lower of the energy levels in S,. The laser 
transition is then from these levels to a level in S . Since there are many such rotational/vibra- 
tional levels within S„ and S,, there are many transitions resulting in an emission line which 
is very broad. As the termination of the laser transition in S is at an energy much larger than 
kT above the bottom of S , the dye laser is a four-level system and threshold is reached with 
a very small population inversion. 

Although the triplet states are not directly involved in the laser action, they have a pro- 
found effect as there is a small probability of a transition S t — »T,. even though this is for- 
bidden by quantum mechanical selection rules. Since the transition T, — »S ( , is also forbidden, 
molecules 'pile up' in T,. The transition T t — >T 2 is allowed, however, and unfortunately the 
range of frequencies required for this transition is almost exactly the same as the laser tran- 
sition frequencies. Thus, once a significant number of molecules have made the S, — »T, tran- 
sition, T, — >T 2 absorption reduces the gain and may stop the lasing action. For this reason, 
most dye lasers operate in short pulses - shorter, in fact, than the time taken forT, to acquire 
a significant population, which is typically 1 us. For long pulse or CW operation, the pop- 
ulation in T, will build up to equilibrium values, in which case the absorption is high and 
becomes the ultimate limitation on the efficiency of the laser. 

Many dyes have been used as laser media, and Fig. 5.48 shows that, by tuning, laser wave- 
lengths covering the whole of the visible spectrum can be obtained. The dye called rhodamine 
6G with methanol as a solvent is one of the most successful, having an efficiency of about 
20% and a broad turnn g ran B e (570-660 nm). 

All dye lasers are optically pumped, the pumping source having a wavelength slightly 
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5.48 Relative outputs of some common laser dyes pumped by ion lasers. Rhodamine 6G (R6C), for 
arn p|e, is here pumped by 5 W of power from all the argon lines. Coumarin dyes are labelled C; C490 is 
pumped by 2.3 Wat 488 nm. 
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FIG, 5.49 Schematic diagram of a tunable CW laminar flow dye laser. The dye stream flow is 
perpendicular to the page. 



less than that of the laser output. Commercial pumping methods include flashtubes, nitro- 
gen lasers, solid state lasers and ion (A + or Kr + ) lasers. As the pumping radiation is in the 
visible or ultraviolet part of the spectrum, we must use harmonics of the output from 
neodymium lasers. The choice of the pump source depends on the absorption spectrum 
of the dye being used and the type of output desired. For CW output, the usual commer- 
cial pump source is an ion laser; all the other pumping sources operate in the pulsed mode 
and produce a pulsed output from the dye laser. As the population of T, builds up in about 
1 us, the flashtubes must discharge in about 1 us (in contrast to the millisecond time of 
tubes used with solid lasers). For this reason, specially strengthened flashtubes must be 
used. 

To avoid absorption losses due to a build-up of molecules in triplet states and thereby 
obtain a CW output, the dye is passed in a liquid jet through the pumping radiation. The dye 
emerges from a specially shaped nozzle in the form of a smooth laminar sheet; the small 
signal gain is so high that this very small thickness of the active medium is sufficient to give 
laser action. The dye flows at a rate of about 10 ms" 1 so that the molecules spend a very 
short time in the cavity, which is less than the lifetime of the triplet states. 

The laser output can be tuned using, for example, a prism, wedge filter or a diffraction 
grating, which can serve as a combined end mirror and dispersing element. A typical dye 
laser arrangement, which might generate an output of 1 W from a 3 W argon ion pump source, 
is shown in Fig. 5.49. 



5.10.5 Parametric lasers 

Although the principles of optical parametric oscillators (OPOs), which were described 
briefly in section 3.9, have been well understood for many years, it is only relatively recently 
that a range of efficient devices has become available commercially (ref. 5.27). The com- 
mercial viability of OPOs has resulted from two factors. First dramatic improvements have 
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been made in the quality of non-linear crystals such as BBO (beta-barium borate), KTP 
(potassium titanyl phosphate), KNB (potassium niobate) and LBO (lithium triborate). 
Secondly there have been comparable improvements in the beam quality, stability and reli- 
ability of appropriate pump lasers such as (^-switched Nd:YAG lasers (sections 5. 10. 1 and 
6.4) and argon ion lasers (section 5.10.3.2). Consequently while conversion efficiencies of 
only 1 l: /o or so were achieved by Giordmaine and Miller, efficiencies of up to W)'/r are now 
common. Though only pulsed devices operating at pulse repetition rates of a few tens of hertz. 
are currently readily available. OPOs operating at very much higher repetition rates, and in 
C'W mode, have been demonstrated. 

As mentioned in section 3.10 an OPO in its simplest form consists of the non-linear crystal 
placed between two mirrors forming an optical cavity, which resonates at either the signal 
or idler frequencies. As the power of the pump laser wave is increased a threshold pimipinu 
power will be reached, at which the parametric gain balances the losses, and the signal or 
idler waves will oscillate within the cavity to produce a coherent output. The practical impor- 
tance of the OPO is its ability to convert the pump power into coherent waves which can 
be tuned continuously over large frequency ranges, in contrast to most of the other lasers 
described in this chapter. 

Tuning of the idler and signal waves can be achieved in a number of ways as the refrac- 
tive indices of the non-linear crystals depend on frequency, crystal orientation (if one or other 
of the waves is an extraordinary one ), temperature and electric field. Giordmaine and Miller 
used temperature variation of the lithium niobate crystal to control the frequency of the oscil- 
lations as indicated in Fig. 5.50. A temperature change of about 1 1°C produces output fre- 
quencies in the range 3.1 x 10 14 to 2.6 x 10 14 Hz (corresponding to a wavelength range of 
968-1 154 nm). 

Alternatively if we remember that in general A- = 2n/X = inn/c, we can write eq. (3.29) in 
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the form 

0)3/(3 = w \ n i + w 2' ! 2 (5.43) 

Hence if the pump waves correspond to extraordinary waves, then changes in the beam direc- 
tion of the pump relative to the crystal's optic axis will change/!,, and hence to, anda), (the 
signal and idler waves are usually chosen to be ordinary waves so that n t =n 2 =» )- A 
change in angle of some 8° can give rise to a tuning range extending from about 400 nm to 
700 nm. 

Commercially available OPOs cover the spectral range from about 400 nm to 4.0 |im 
though oscillations from 300 nm to 18 urn have been demonstrated. Although there are many 
actual and potential applications for a continuously tunable laser source, such as photo- 
chemistry, including the study of ultrafast chemical reactions (refs 6.4d and 6.8), high res- 
olution spectroscopy and biomedicine, considerable improvements in the beam quality are 
required, and the linewidths need to be reduced. In addition, present average power levels 
are below those of other tunable laser sources, and parametric processes are difficult to exploit 
under CW conditions because of the low gain available when the non-linear crystal is pumped 
with CW radiation. 

5.10.6 The free electron laser 

We conclude this chapter with a brief description of a laser which has enormous potential 
in terms of its possible applications; these arise from its tunability over a wide wavelength 
range (from the extreme ultraviolet to millimetre wavelengths), with high power and high 
efficiency. Although the operating principles have been verified, the device is still at the 
developmental stage, where the main aims are to reduce its size and cost. 

The operation of the free electron laser (FEL) are quite different from any of the lasers 
considered so far (ref. 5.28). The basic source of energy is a beam of relativistic electrons 
(i.e. electrons travelling at a speed very close to that of light), which is generated by an elec- 
tron gun (which is, in fact, a particle accelerator). Under certain circumstances the electrons 
can be induced to give up some of their energy to a beam of photons travelling in the same 
direction, thereby amplifying the beam. The photon beam is from another laser, such as a 
C0 2 pulsed laser. 

Now as the electric field of the light beam is at right angles to its direction of travel, the 
photons cannot receive any energy from the electrons unless they too have a component of 
velocity perpendicular to the direction of travel. Hence the electron beam is given an oscil- 
latory path by passing it through a so-called 'wiggler' magnet, which consists of a row of 
alternating NS/SN magnets as shown in Fig. 5.51. This generates a spatially periodic mag- 
netic field so that the electrons undergo the transverse oscillations indicated in the diagram. 
As a result of these oscillations the electrons emit monochromatic radiation in the forward 
direction at a wavelength which depends on their velocity, the magnet spacing and the value 
of the magnetic field. In fact the wavelength peaks at a value given by 
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where X w is the spatial period of the wiggler, and y is the ratio of the total electron energy 
to its rest energy, which is 0.5 1 1 MeV (ref. 5.28). Such radiation is termed magnetic. The 
bandwidth, AX, of the radiation is given by 



T~2N 



(5.45) 



where N is the number of wiggler periods. 

Although one might expect that the radiation from the electron would now simply inter- 
act with the original laser beam and amplify it, this does not occur as readily as might at 
first appear. If the electron beam is 'tuned' to emit at exactly the same wavelength as the 
original light beam then in fact the energy exchanges between the two beams are equal and 
opposite. 

One way of solving this problem is to inject the electrons at an energy slightly higher than 
that corresponding to the resonant wavelength (or 'resonant energy'); the electrons then tend 
to be 'pulled back' towards the resonant condition. Obviously only a small amount of energy, 
that is the difference between the initial and final electron energies, can be transferred in 
this way. Alternatively the wiggler period may be varied so that the final resonant energy 
is less than that at the start. The electrons then tend to 'home-in' to the resonant transfer con- 
dition and in so doing give up energy to the light beam. To make the FEL more efficient 
requires a careful reuse of the electron energy once the electron beam has trans versed the 
interaction region; although various approaches have been suggested to accomplish this, none 
is yet very successful. 



(juT) Conclusion 

We have by no means covered the entire range of lasers currently available, nor fully dis- 
cussed the various modifications and refinements of the lasers which we have described in 
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this chapter. We believe, however, that the basic laser physics covered, together with the 
descriptions of some typical lasers from the various 'classes' of laser, will enable the reader 
to understand the mode of operation of other lasers which might be encountered, or which 
undoubtedly will be developed in the future. 



NOTES 



In general, the probability of occupation of all the available states or energy levels is not 
the same. The states have a probability #■ of occupation (g is often called the degener- 
acy). Frequently, the ratio gjg : in eq. (5.5), for example, is of the order of unity and hence 
is often omitted. 

The unit of energy most frequently used in this text in the energy level diagrams is the 
electron volt (eV). Many texts, however, use energy units expressed as cm' , a term often 
used by spectroscopists; 1 eV = 8065 cm" 1 . 



PROBLEM5 



5.1 At what temperature are the rates of spontaneous and stimulated emission equal (take ; 
X,, = 500 nm)? At what wavelength are they equal at room temperature (T= 300 K)? 

5.2 If l % of the light incident into a medium is absorbed per millimetre, what fraction is 
transmitted if the medium is 0. 1 m long? Calculate the absorption coefficient a. ; 

5.3 If the irradiance of light doubles after passing once through a laser amplifier 0.5 m long, j 
calculate the small signal gain coefficient k assuming no losses. If the increase in irra- 
diance were only 5%, what would A' be? 

5.4 An atom has two energy levels with a transition wavelength of 694.3 nm. Assuming j 
that all the atoms in an assembly are in one or other of these levels, calculate the per- j 
centage of the atoms in the upper level at room temperature (7 = 300 K) and at T= 500 K. 

5.5 Calculate the degree of population inversion required to give a small signal gain coef- 
ficient for a C0 2 laser (1 = 10.6 urn) of 0.5 m -1 . Take the Einstein A coefficient for the j 
upper laser level to be 200 s" 1 . | 

5.6 Calculate the Doppler-broadened linewidth for the C0 2 laser transition (Kq = 10.6 ujti) 

and the He-Ne laser transition (X = 632.8 nm) assuming a gas discharge temperature i 
of about 400 K. Take the relative atomic masses of carbon, oxygen and neon to be 12, j 
16 and 20.2 respectively. 

5.7 Calculate the mirror reflectances required to sustain laser oscillations in a laser which j 
is 0. 1 m long, given that the small signal gain coefficient is 1 nT 1 (assume the mirrors j 
have the same values of reflectance). | 

5.8 Calculate the threshold small signal gain coefficient for ruby given the following } 
data: threshold population inversion 5 x 10 22 m~\ refractive index 1.5, linewidth j. 
2 x 10" Hz, Einstein A coefficient 300 s _l and wavelength 694.3 nm. 
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r y9 What is the mode number nearest the line centre of the 632.8 nm transition of the 
He-Ne laser; what is the mode separation; how many longitudinal modes could pos- 
sibly oscillate given that the width of the gain curve is 1.5 x 10 y Hz? Take the mirror 
separation to be 0.5 m. 

5.10 By considering the rate equations for a three-level laser, derive an expression for the 
threshold population inversion in terms of the laser parameters. Hence compare the 
pump power required to bring a three-level and a four-level laser to threshold. 

5.1 1 Calculate the threshold pumping power for an Ndrglass laser, given that the critical 
population inversion is 9x 10 2 ' m~\ the spontaneous lifetime is 300 us and that the 
upper laser level is at an energy of 1.4 eV. 

5.12 Assuming that the exciting lamp is 10% efficient, that 10% of the light produced actu- 
ally falls on the crystal, which has a diameter of 2 mm and is 0. 1 m long, and that 5% 
of the exciting light energy falls within useful absorption bands, estimate the power 
to be supplied to the lamp used to pump the laser in Problem 5.11. 

5.13 Estimate the efficiency of a GaAs laser operating well above threshold, given that 
n = 3.6 and that the length of the laser cavity is 200 jam. Take the loss coefficient to 
be 800 m~ ' and the internal quantum efficiency to be 0.8. 
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Lasers II 



In the previous chapter, we saw that the output of lasers does not always consist of a beam 
of very coherent, almost monochromatic radiation. The output, for example, may be contin- 
uous or in the form of mutually incoherent spikes within a pulse and consist of several lon- 
gitudinal and transverse modes of slightly different wavelength. In considering the applications 
of lasers we often find that such characteristics are quite acceptable, but equally it is often 
desirable to modify the laser output to suit a particular application. Some modifications to 
the output can be achieved quite simply: for instance, we can select one of the many wave- 
lengths produced by the argon ion laser by introducing a prism or grating into the optical cavity. 
The prism or grating disperses the light so that after transmission only one wavelength falls 
normally onto the end mirror and is reflected back into the cavity. Other modifications of the 
output, though often quite readily achieved in practice, require a clear understanding of the 
concepts of modes, population inversion, threshold gain and the like. Before discussing a small 
selection of the many and varied applications of lasers, we first consider some of the ways 
in which the laser output may be modified to facilitate these and countless other applications. 



C 6J ) Single mode operation 

In many applications including chemical and physical investigations it is desirable to have 
the greatest possible spectral purity. We can achieve this by operating a CW laser in a single 
longitudinal, single transverse mode. Since an inhomogeneously broadened laser (see section 
5.9 and below) can support several longitudinal and transverse modes simultaneously, single 
mode operation can be achieved only by arranging for one mode to have a higher gain than 
all the others. We can ensure that the cavity will support a single transverse mode only, the 
TEM (X > mode, by placing an aperture within the cavity. As the higher order TEM modes 
spread out further than the TEM, )0 mode, an aperture of suitable diameter will transmit the 
TEM ()0 mode while eliminating the others. All but one of the longitudinal modes can then 
be rejected by reducing the length L of the laser cavity until the frequency separation between 
adjacent modes, that is 8v = r/2L (see section 5.9), is greater than the linewidth of the laser 
transition. Figure 5.1 1 then shows that the single mode which falls within the transition 
linewidth is the only one that can oscillate. 

The disadvantage of this system is that the active length of the laser cavity may become 
so small as to limit the power output severely. This may be overcome using other techniques 
involving, for example, a Fabry-Perot resonator either inside or outside the laser cavity, 
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third-mirror techniques or absorbers wilhin the cavity (ref. 6.1 ). It should be stressed thai 
to maintain the wavelength of the single mode output at a constant value, we must stabi- 
lize the cavity dimensions by rigid construction and temperature control or by introducing 
compensating systems. If this is not done. /. will change and the frequency and power of 
the laser will change as a consequence. In passing, we note that it is possible to stabilize 
I he operating frequency of a laser to better than I MHz. or about 1 part in 10". Ways of doing 
this are described in the next section. 



(J>^J Frequency stabilization 



In lasers with homogeneously broadened transitions, an increase in pumping cannot increase 
the population inversion beyond the threshold value where the gain per pass equals the losses. 
T his is because the spectral lineshape function g{v) describes the response of each individ- 
ual atom, all of which are considered to behave identically. Thus, as the pumpinti. is increased 
from below the threshold value the laser will begin to oscillate at the centre frequency v„ 
The gain at other frequencies will remain below threshold, however, so that an ideal homo- 
geneously broadened laser will oscillate only at a single frequency. 

In inhomogeneously broadened lasers, on the other hand, where individual atoms are con- 
sidered to behave differently from one another, the population inversion and gain profile 
can inerease above the threshold values at frequencies other than v () . The gain at v . however, 
remains clamped at the threshold value owing to gain saturation (section 5.6). Further 
pumping may increase the gain at other frequencies until oscillations commence at those 
frequencies also. This results in decreases in both the population inversion and the gain to 
their threshold values. The gain curve therefore acquires depressions or 'holes' in it at these 
oscillating frequencies - this is referred to as hole burning. The gain curves for homoge- 
neous and inhomogeneous atomic systems are illustrated in Fig. 6.1, where the curves labelled 
A, B and C correspond to pumping levels respectively below threshold, at threshold and 
above threshold. 



Gain threshold 





v 

(a) (b) 

FIG. 6-1 Gain CUrVeS f ° r (a) a h »™^ ne ^sly broadened atomic system and (b) an inhomogeneously 
broadened atomic system (A, below threshold; B, at threshold; C, above threshold*. In (b) different groups 
of atoms respond to the simulating radiation at different cavity mode frequencies. The gain is saturated at 
acn oi these frequencies independently, creating 'holes' in the gain curve. 
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Lei us now consider gas lasers which are usually inhomogeneously broadened owing pri- 
marily to the Doppler effect as explained in section 5.7. We suppose that a single mode is 
oscillating at frequency v m which is greater than the natural emission frequency of the atoms, 
v„. The oscillation, being a standing wave within the cavity, consists of two sets of waves 
travelling in opposite directions, say the positive and negative ,v directions respectively. Both 
of these waves have the frequency v m . 

The interaction of the waves travelling in the positive v direction with the atoms in the 
laser medium will be greatest for those atoms that have a velocity component in the .v direc- 
tion of +?>, such that 



v.., = v, 



I +- 



(6.1) 



For this group of atoms, the apparent frequency of the waves is v m and the atoms are stim- 
ulated to emit. The argument also holds for a second group of atoms and waves moving in 
the negative ,v direction. There are therefore two groups of atoms whose stimulated emis- 
sion contributes to the laser output intensity; the population inversion is reduced for these 
atoms and gain saturation occurs. 

We have plotted the population inversion N as a function of the .v component of velocity 
in Fig. 6.2(a) where we see that stimulated emission produces a saturation in the excited state 
atomic velocity distribution similar to the hole burning in the gain curve. Two 'holes are 
burned'; these are symmetrically placed about i\. = and correspond to atoms with veloci- 
ties of plus and minus v x . 

Suppose that now the frequency of the oscillating mode is changed until it equals the peak 




FIG. 6.2 Distribution of the population inversion N as a function of the x component of velocity of the 
atoms »',. In (a) the cavity frequency v m is different from the natural emission frequency of the atoms v and 
two 'holes' are created in the distribution. In (b) the cavity frequency equals the atomic resonant 
frequency, i.e. v m = v n and only one hole is created, corresponding to v x — 0. 
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frequency of the laser line, that is v,„ = v . This may be accomplished, for example, by varying 
the temperature to change the cavity length slightly. Under these circumstances, only a single 
group of atoms can contribute to the lasing process, namely those with zero.v component 
of velocity, and there is a single 'hole' in the population inversion-velocity curve as shown 
in Fig. 6.2(b). When this happens, the laser output power drops as the available inverted 
population is smaller than before. A plot of output power as a function of frequency v as 
in Fig. 6.3 then shows a dip, the Lamb dip. at the centre frequency v m = v„. 

The increase in power resulting from any slight deviation from the centre of the laser line 
can be used, as the basis of a feedback system, to stabilize the frequency of the laser at the line 
centre by minimizing the output. Such techniques enable the frequency to be stabilized to better 
than 1 part in 10 y . The long coherence length makes the output of lasers stabilized in this way 
useful in applications such as long path difference interference measurements (section 6.6. 1 ). 
Another technique for stabilizing gas lasers relies on the fact that the gain profile is sym- 
metrical about its midpoint. Consider the situation when only two modes are operating with 
equal irradiance.s. They must be equally spaced in frequency either side of the gain profile 
maximum. Any drift in the mode frequency will cause one mode to increase in irradiance 
and the other to decrease. Thus, if we are able to monitor the two-mode irradiance and then 
use the difference to operate a feedback loop that controls the cavity length, we should be 
able to stabilize the operating frequency. At first sight, the measurement of the mode irra- 
diances might seem difficult. Fortunately it is usually found that adjacent cavity modes are 
plane polarized with their planes of polarization at right angles to each other. Thus we need 
only split the laser output into two, insert suitably orientated pieces of polaroid into the two 
beams, and then allow them to fall onto two detectors. The output of the detectors is then 
proportional to the irradiances of the modes. A simple way of applying feedback is to let 
any difference in output of the detectors modulate the current passing through a heating coil 
wrapped round the laser tube. Any changes in relative mode irradiance will then alter the 
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FIG 6- 3 Output power as a function of frequency of a single mod 
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curve (v = v ) is known as the Lamb dip. 
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tube temperature and hence the effective length of the cavity. This in turn will vary the mode 
frequencies (ref. 6.2). 

As might be expected, the techniques used to frequency-stabilize semiconductor lasers 
are somewhat different from those used in gas lasers. Obviously, the first step is to ensure 
that only a single mode can oscillate. Obtaining a single transverse mode is relatively easy: 
the gain region may be narrowed using the techniques discussed in section 5.10.2.3. The 
gain curve is relatively wide, however, and several longitudinal modes are usually present. 
Even if we ensure that only one of these can oscillate, there are problems with temperature 
stability. These arise because the position of the gain profile depends on the bandgap, and 
this in turn is temperature dependent. Thus, a change in temperature could cause the posi- 
tion of the gain profile to alter sufficiently to cause the laser to 'hop' to another longitudi- 
nal mode more favourably placed with regard to the gain profile. 

One way to improve this situation is to provide some wavelength-sensitive feedback that 
is relatively insensitive to temperature. A very effective way of implementing this is to use 
a 'distributed feedback' structure (ref. 6.3). The way in which this operates may be appre- 
ciated by reference to Fig. 6.4. This shows a laser diode structure with a region at either end 
containing corrugations. These act as frequency-selective minors. To see this, we may con- 
sider two beams emerging from the pumped region at a small angle to the axis and being 
reflected from different parts of the corrugations as shown in Fig. 6.5. There will be con- 
structive interference between the two beams when 2D = in^Jn, where D is the 'wavelength' 
of the corrugations, X the vacuum wavelength, n the refractive index of the laser medium, 
and m an integer. The corrugated structure will only act as a mirror for a particular mode if 
its wavelength satisfies the above equation. Thus, by choosing an appropriate value of D 
we can ensure that only one particular longitudinal mode can oscillate. The formal treatment 
is rather complex and the interested reader is referred to ref. 6.4. 

Although the structure of Fig. 6.4, which is often referred to as a distributed Bragg reflec- 
tor (DBR), undoubtedly works, it has the disadvantage that light enters the unpumped end 
regions, where it may suffer a considerable amount of absorption. For this reason a more 
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FIG. 6.4 Schematic diagram of a heterostructure laser incorporating regions at either end which have 
corrugations etched along one side. Such regions act as frequency-selective mirrors. 
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F ,C 6.5 Two beams of light being reflected from corresponding adjacent reg.ons of the «>rruga.,o„ 
shown in Ftg. 6.4. The two reflected beams will remain in phase provided that the path deferences are 
multiples of the wavelength. Assuming that the beams make small angles with the laser axis, this path 
difference is very nearly equal to 2D. 

efficient structure is given by extending the corrugations throughout the pumped region thus 
giving nse to the distributed feedback (DFB) laser shown in Fig. 6.6. Here, it will be noticed 
that the corrugated region is in a layer adjacent to the active layer, there being sufficient coup- 
ling between the two for the mirror action of the corrugations to be effective. This is because 
it is difficult to form the corrugations directly on the active layer without introducing defects 
into the active region that degrade the performance of the laser. 

An alternative approach is to pass the light through an additional, external coupled cavity 
In this way the only modes which can interfere constructively and oscillate are those which 
resonate in both cavities. This results in a much greater mode frequency spacing so that only 
one mode lies within the gain curve of the active medium. 

Several designs have been used to achieve this including an external mirror (ref 6 5a) and 
the so-called cleaved coupled cavity (C 3 ) arrangement (ref. 6.5b). In the latter case two laser 
cavities are formed by cleaving from the same original chip so that the end facets (mirrors) 
are perfectly parallel to each other. The two cavities are then mounted parallel to each other 
on a heat sink. The current to each can be controlled separately. In operation the current flow 
to one of the cavities is above threshold so that it acts as the laser, while that to the other is 
kept below threshold, but its magnitude can be used to alter the carrier density and hence 
the refractive index of the cavity. In this way the operating frequency can be tuned over a 
fairly wide wavelength range. 
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b f, Distributed feedback laser structure. 



25 USERS II 




A * ^ saw in the n ev , niqUe ** pr ° ducin g Periodic, high power, short duration fr ^ 
s "Pport oscillations in SeCtl ° n ' a typical ^homogeneously broadened laser g f ^ c - 
tion of ' "me de end. onT T*™ simu]ta "-usly .^he output of such a las* » fl* 
tol «> electric field , s , f !" MlVe phases > frequencies and amplitudes of the 
CldSdfu ^.onof time can be written,, 

(6. 



total electric field as , f , tive phases > frequencies 

v _, CldSdfunc t.onof time can be written as 

^ f)= S^ ;,,) " exp[i( ^+fi„)i 



2) 



^f^C^^^^^. angular frequency and phase o^Vj 
irrad 'ance is .simply the sum I u™ "' drym ^ so tha t the modes are incoherent an ^ 
1 2 ' 2 - Hence, for t hi ,i tu ? ^ ilTadlances ° f the individual modes as we ^ 
,-^ 2 ' Sltua t.on, which is illustrated in Fig. 6.7(a), 



radiance mty ethStSl f^^'* ** a " N modes have *e same ampl^ e ^ 
at a "y given time. ' nuclua ti°ns if two or three of the modes happen to * 

Suppose that w e now f hase 8 1 ° 

^ another, that is We mZT,^ 7^ modes <o maintain the same relative Pj ^ 
* e fo "nd by adding the ^ e '** the laser such that 8„ = 5. The total irradian* * (6 .2)- 
the «*u Ita nt electric field I'' ^ fieWs rather than the irradiances. Using * 
Can now be written as 



(6 



**.<£* 



* (0= Wi 5) £ exp(i 
«=o 
For convenience let 

f ^ n cyofthehigiS 1 ^ i ^ larfre q«ency M as co - M -«8a». where a*^*. 
mod -, which fro^ eq si; " w C e y «** and 5w is the angular freutncy *■*■** * 

Equation (6.3) for€(0 

y^an then be rewritten as 

*(') = f <S () exp(i5) V PYnrv 

i CXp[l(w -"So)?] 



.3) 



,# 



= '£, 



exp 



or 



^^^gexpf-^^ 



r ^ f )-^ () exp[i (a)f + 6)]|1+ ^ 
where 4» = i t a//.. The tem , ; _ ^ + ex P(~2i« + ... + eX p[-(/V- l)i*] 



(6, 



4) 



C temi m br3CeS " * <") » a geometric progression ** «* *» 



$& 



''eld 



lr ra 



raj 



'anci; 



-AAAM/VWWWWr 



(a) 



WOt LOCKING Ml 



/. 



= /i+^+>'* 



f ield 



lr ra d ; 



ian Ce 4 







+/,)* 



4.7 









0>) t . In ,.n the irwdiantt? 

•f (a) non-mode-iocked and (b) mode-locked ^^^ instant and the 
lsts *fc while in <b) the ph.se relationships of the mud^ ^^ {l/N)(2t /,-». 

0t n writ's of narrow intense pukes of time spacing Uf 



lr radi 



: « ex P[i(w +8 )]^*^ 



liant *'i S then/-Y. 



^U-^sin 2 



sin(<>/2) 
(0-^*(0or 



T~h e 




(6.5) 



S^T* eqUatl0n (for ^ is illustrated in Fig- *W interva l r-2^J* 
4u als e hat the { . odic (M = 2n) ,n the «^. 5 «,um value of the '« 

Nu e »pt«— - ■ — * 

fu ictio n » Sl 



inthetimeinterv ftheirra . 

ance ,(T) is penuu.- v-* - - The maX imum v e of 

sit time for light within the cavity . ,_ or where trie 

■-.for values of <t> = nnrofl - /,b 

2 (<t>/2) equals N 2 . 



- ,U1 "& „ hpine an integ cl > 

. , -occurs for values of <|i = or /?n./' DC1 h 

Sln ^^/2)/sin 2 ( " 



252 LASERS II 



Whldl « not ,ero t h t " e , " S mmimum Vlllue * °f «ro when W/2 =/*•/' ^'"L^ '* 
n,iwi ™. *hich is\he ; f " * = 2PK/N or ' = U/")t2L/c) P . Thus the time dura dJ , 
^ — </> = I 1 ( Zn ^ f ° r the irTadi »« <° fa» ^ its maximum v^J,^ 
laser consists of a seuuen - ' We can «*, therefore, that the output of a j* ' ^ 
^ l » " times the vcr " °* Sh ° n puLses ' separated in time by 2L/c. each o P ^ 
un «'"Pled). The ratio of f PU , Wei " ( ° r " ,imes ' he Power of the same laser with' ^ ,„ 
nun,h « »f modes that Inn 7 Spacin S to the P"I-« width is approximately ^ dlll r 



nu, » h " of modes that is/.?/ 7 Spacin & to the P"^ width is approximately 
t,0n PU >^ ^ere '^Z^P^ 1 '* ^ ""• Thus, to obtain high powe> 
ilnd a Io "g laser cavity. " ge number of modes; this requires a broad laser 



EXAMPLE 61 ^ . 

"•' Mode- bckpH „ i 

Lked pulses 



yA c 

long ; 



,ulllM pulses ^_____--- 

'^^•-he^th^lu^^^^^^n and pulse duration in a mode-locked^ 
w ^uke,he ^fractivrS;" 6 * T 8 iS U * I0 " ^ ^ ^ ^ ^ 
" ( ;i : '^S^ :; ^ 10- Hz; thus the numberof modes «ff 
^^^''ation is'mN^ m '^ The puIse se P ara,ion iS U " A " " 

ba cK 



The situation cm h • " ~ " A $ 

^— ^- ";;;:^^y -s Iden n g a s hor t ^, v ^ ******** 
c u ;: r iaiiy *ix t puhe * emiued b >' the — ^ eac h s < 

ar.on i ^"Iarly useful w h / n ° UtpUt mi nor as indicated in Fig. 6.8- ^ ^ *» 
lrfc0n '»" "nd Nd:gi ass Ia . ser " hen de ^nbing the active mode-locking mechanic 



iil> 



6.3.1 



Activ * mode locking 



-—-mug 

^£?"™ ^-™*h^ he l0 ^' M -' -de., ,o mi ,intain fixed p£$ 



shutter 



'magme th^T t0 the 'merrnode f/ m ° dula,in g me loss (or gam) or in- •- . y*- 
. gme th * the I^ modulation -f Dr ^" enc y Ration 5v = c/2L (or 5co = «f /^ ^ 
^-espon dine n P ° Vld L ed ^ a shutter placed near one of the *>«*> ^d 
1 ■ - • ; only 



'-• , ^'«„^ ndi ;; r ^ed, terpl 

l y mgh losses) mos 



s ses) most of the time and is 




f IC 6.8 



Packet of 



rr esuiti ^-th 



r ^ el is incident on th ° '" " """ lues oouncmg to duu ■■" 

e °Wput mirror, a 'mode-locked' pulse is 



nt "1° btking ° f N ™** bouncing to and fro ^ 
nt " n the output mirror . . m „^„ ,„„,L- „„i«p is em'« ed 



the 



M ODf LOCKING 2™ 



vo 



er Vb 



'Hotly 



I trio tiineofihe wave 
ev ery ILjv seconds (corresponding to the cavity ^ ^^ ^ ^ slwWr 
U) ned above). Now if the wave packet is exactly as ^ inatTcCtc d by the P'^- 



f k «ment 

Mily.S () ) 

^•c ,,n"l a " d ir,t a Tives exactly when the shutter is open, it wi 



• , before the shutter 
<^ hQlte ' Any parts of the wave packet, however. ^ ^ ( , f - , he oscillating 
" ■ Utter » closes will be eliminated. Thus, the phase relationsliip. 



»fth eshL 






A -, r uses Wl11 be eliminated, ltuis, uk ^ 

' e continuously restored bv the periodic operation of the s ^ ^ ^ ^^ as shul . 
^ro-optie or acousto-optic modulators discussed m Chapt - ^. ^ ?() ps 

ltllr ^n i 8 x Se l ° ^e-Iocted pulses from an Nd: YAG laser, to , lx. i ^^ ^^ 

V- Uu -' n Nd:glass lasers, on the other hand, which generate a ve.y^ dura(unl (see 

Pf0h1 ^ 6 7) br ° Ud h ^ tra '- tlo » !ine " the pulSeS CUn S 



i-2 



M ," PaSSive ^ode iockin 



'g 



can also be accomplished by using certain dyes w lost- hehaviour are called 
-'"g irradiance as shown in Fie. 6 9. Materials exhihttn t ^ ^.^ tranM , 
'^'^orbers, A dye 1S chosen which has an absorption ^ ' numbe r or«r 
L *' tccl rnol nitlall y-atlov 



.. .„ „ .... j:?^l 6 9. M-tcrials exhib.tnig ^ ^^ , ariM „on 

%e „ c ' '"^ers. A dye is chosen which has an absorption ban numlier r unex- 

■"^ m P ,' tially ' at low W levels, the dye is opaque owing t ' * . nlorc and 

" * of J? 165 -hich can absorb the light. As the irradiance ,nc ea£. ^ ^ , he dye 
^nt es , 6XClted states are populated until eventually all ot them a 

The groT SParem - Thed >' eisnowsaidtobe/,/ ' ' " • , "' t ' :, " V ' 

f^iutn errf^ ° f the ni ode-Iocked pulses can t 



^es tr aies are populated until evn»u».'; - 

Th e Srnu an " Paren t- The dye is now said to be bleached. In , tialIy , the Iasei 

>* m T ° f the ^de-locked pulses can be envisaged as ■ ^ ions in the energy 

S«y S" 8 ^ntaneous radiation which g.ves rise to inccher ent u duration . ffl ay 

6 ^pS n v. the Cavit >'- Some of these fluctuations, which can ^ ^ (he ^ 

Part _ Ule d bv th» ,....__ .. , ._...:„ ; m ,Hiance to sutn an _^ nil .„ on . The 



Ulu m em -. Llie ^de-locked pulses can oe en— e> fluctuations n. — -- 

Cns «y wS S Sponta neou S radiation which gives rise to incoheren ^ duration . I1ia y 

6 ^C n v, the CaVity ' Some of these f"* tuationS ' Wh, tCh a" extent that the peak 
M °f ih en y the Iaser medium and g row m imld ' anCe k with little attenuation The 
^ Pow e e r nUct «at.on is transmitted by the saturable ataorbe ™* ^ and h „ 

' gh P ^^ ^^. ^"^--rd^rlcover in a time shoit 



Part 



lls e can 



""■"»^^« -/ , re strongly ""- 

fluctuation, however, are much mo n rec0 ver 

grow within the cavity providing the y 



V>» 




Irradiance 



r Ption 



as a function of incident Iiph 






254 LASERS II 

compared with the duration of the pulse. Because of the non-linear behaviour of the dye. 
the shortest and most intense fluctuations grow at the expense of the weaker ones. With 
careful adjustment of the concentration of the dye within the cavity, an initial fluctuation 
may grow into a narrow pulse 'bouncing' to and fro within the cavity producing a periodic 
train of mode-locked pulses. 

Saturable absorbers provide a simple, inexpensive and rugged method of mode locking high 
power lasers such as Nd:glass and ruby; the so-called 9740 or 9860 dye solutions and cryp- 
tocyanine may be used as the saturable absorber for Nd:glass and ruby respectively. When 
a saturable absorber is used to mode-lock a laser, the laser is simultaneously ^-switched 
(section 6.4). The result is the production of a series of narrow (-10 ps) mode-locked pulses 
contained within an envelope which may be several hundred nanoseconds long. The peak 
power within the individual pulses may be enormous because of their very short duration. 



( , 6 - 4 ) Q-switching 

(^-switching is another technique for obtaining short, intense bursts of oscillation from lasers. 
Single high power pulses can be obtained by introducing time- or irradiance-dependent losses 
into the cavity. The effects of such losses can be interpreted in terms of the 'spiking' oscil- 
lations discussed in section 5.10.1. If there is initially a very high loss in the laser cavity, 
the gain due to population inversion can reach a very high value without laser oscillations 
occurring. The high loss prevents laser action while energy is being pumped into the excited 
state of the medium. If, when a large population inversion has been achieved, the cavity loss 
is suddenly reduced (i.e. the cavity Q is switched to a high value), laser oscillations will sud- 
denly commence. On g-switching, the threshold gain decreases immediately (to the normal 
value associated with a cavity of high Q) while the actual gain remains high because of the 
large population inversion. Owing to the large difference between the actual and the thresh- 
old gain, laser oscillations within the cavity build up very rapidly and all of the available 
energy is emitted in a single large pulse. This quickly depopulates the upper lasing level to 
such an extent that the gain is reduced below threshold and the lasing action stops. The time 
variation of some of the laser parameters during g-switching is shown schematically in Fig. 
6.10. £)-. sw 'tching dramatically increases the peak power obtainable from lasers. 

In the ordinary pulsed mode, the output of an insulating crystal laser such as the Nd:YAG 
consists of many random 'spikes' of about 1 lis duration with a separation of about 1 us; 
the length of the train of spikes depends principally on the duration of the exciting flash- 
tube source, which may be about 1 ms. Peak powers within the 'spikes' are typically of the 
order of kilowatts. When the laser is Q-switched, however, the result is a single 'spike' of 
great power, typically in the megawatt range, with a duration of 10-100 ns. It should be noted 
that, although there is a vast increase in the peak power of a ^-switched laser, the total energy 
emitted is less than in non-Q-switched operation owing to losses associated with the Q- 
switching mechanism. 

^-switching is carried out by placing a closed shutter (i.e. the Q-switch) within the cavity, 
thereby effectively isolating the cavity from the laser medium. After the laser has been 
pumped, the shutter is opened so restoring the Q of the cavity. A little thought reveals that 
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fiG 6 10 Schematic representation of the variation of the parameters - (a) flash lamp output, (b) cavity 
n (c) population inversion and (d) output power - as a function of time during the formation of a 
(^switched laser pulse. 



there are two important requirements for effective Q-switching. These are: 

1 The rate of pumping must be faster than the spontaneous decay rate of the upper lasing 
level, otherwise the upper level will empty more quickly than it can be filled so that a 
sufficiently large population inversion will not be achieved. 

The Q-swUch must switch rapidly in comparison with the build-up of laser oscillations, 
otherwise the latter will build up gradually and a longer pulse will be obtained so reduc- 
ing the peak power. In practice, the Q-switch should operate in a time less than 1 ns. 



2- 



6.4.1 
6A.11 



Methods of Q-switching 
Rotating mirror method 



This method, which was the first to be developed, involves rotating one of the mirrors at a 

-, high angular velocity so that the optical losses are large except for the brief interval in 

ch rotation cycle when the mirrors are very nearly parallel. Just before this point is reached 

• „ er rnechanism initiates the flash lamp discharge to pump the laser (here assumed to 
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be of the insulating crystal type). As the mirrors are not yet parallel, population inversion 
can build up without laser action starting. When the mirrors become parallel, ^-switching 
occurs allowing the g-switched pulse to develop as illustrated in Fig. 6.10. 

It should be pointed out that the repetition rate of laser firing is determined by control of 
the flash lamp and not by the speed of rotation of the mirror, which may be as high as 
60 000 revmin" 1 . If the laser fired every revolution, the repetition rate would be about 1000 
times per second, a rate which is prohibitive in insulating crystals owing to the excessive 
heating of the laser rod which would occur. Although rotating-mirror-type {^-switches are 
cheap, reliable and rugged, the method suffers from the major disadvantage of being slow. 
This results in an inefficient production of (^-switched pulses with lower peak power than 
can be produced by other methods. 

EXAMPLE 6.2 Energy of Q-switched pulses . 



We may estimate the energy of the output pulses from a ^-switched laser as follows. We 
assume that the population inversion is N t before the cavity is switched and that it falls to 
N f at the end of the pulse (as shown in Fig. 6.10c). 

The total energy emitted in the pulse is thus E = jhv 2l (N i -N ( )V, where V is the volume 
of the laser medium. The factor \ appears because population inversion changes by two units 
each time a photon is emitted (i.e. the population of the upper level decreases by one while 
that of the lower level increases by one). 

In a typical laser, N { may be approximately 10 24 m~ 3 , and assuming N i <N l and that the 
laser frequency is 5 x 10 I4 Hz and its volume is 10" s m 3 we have that the energy of the pulses 
is 

E= ^ (6.63 x 10" 34 ) x (5 x 10 14 ) x 10 24 x 10" 5 = 1.7 J 



It can be shown (ref. 6.6) that the peak power, that is the greatest rate of change of pop- 
ulation inversion and hence of photon emission, occurs when the population inversion drops 
to the threshold inversion N th . To estimate the average power in the pulse, we need to evalu- 
ate the pulse duration. We imagine that the ^-switched pulse oscillates to and fro between 
the laser mirrors and that each time it strikes one of the mirrors a fraction (1 -/?) of its energy 
is lost by transmission. The pulse will then make 1/(1 -R) passes along the length of the 
cavity, which it accomplishes in a time [ l/( 1 -R)][L/c). This is often referred to as the cavity- 
lifetime t c and may be taken as the duration of the pulse. The power of the pulse is then 
approximately P = E/t c , which, as E = ^hv 2l (N i -N f )V (see Example 6.2), we can write as 

p = Yl (6 - 6) 



EXAMPLE 6.3 Power in Q-switched pulses 



Using the data given in Example 6.2 we can estimate the power in the <2-switched pulses 
from a laser with a cavity length of 0. 1 m and a mirror reflectance of 0.8. 
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The cavity lifetime r c = L/(l -R)c= 1.7 ns. Then the pulse power is given by 
£ /, c= 1.7/1.7 xl(T 9 =10 g W. 

In practice, owing to losses associated with the Q-switch, the actual power in the pulse 
would probably be nearer 10 s W. 

6,4.1.2 Electro-optic Q-switching 

The electro-optic, magneto-optic and acousto-optic modulators described in Chapter 3 can 
he used as fast g-switches. If a Pockels cell, for example, is used and the laser output is not 
naturally polarized, then a polarizer must be placed in the cavity along with the electro-optic 
cell as shown in Fig. 6.H. 

A voltage is applied to the cell to produce a quarter-wave plate which converts the lin- 
early polarized light incident on it into circularly polarized light. The laser mirror reflects 
this light and in so doing reverses its direction of rotation so that on repassing through the 
electro-optic cell it emerges as plane polarized light, but at 90° to its original direction of 
polarization. This light is therefore not transmitted by the polarizer and the cavity is 'switched 
off When the voltage is reduced to zero, there is no rotation of the plane of polarization 
and (2-switching occurs. The change of voltage, which is synchronized with the pumping 
mechanism, can be accomplished in less than 10 ns and very effective Q-switching occurs. 

Alternative arrangements using Kerr cells and acousto-optic modulators are available. In 
the case of the acousto-optic modulator, application of an acoustical signal to the modula- 
tor deflects some of the beam out of the cavity (see Fig. 3.21), thereby creating a high loss. 
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When the sound wave is shut off, Q-switching occurs as before. Acousto-optic devices are 
often used when the laser medium is continuously pumped and repetitively Q-switched, as 
is frequently the case with Nd:YAG and C0 2 lasers. 

6.4.1.3 Passive Q-switching 

Passive (^-switching may be accomplished by placing a saturable absorber (bleachable dye) 
of the type mentioned in section 6.3.2 in the cavity. At the beginning of the excitation flash 
the dye is opaque, thereby preventing laser action and allowing a larger population inver- 
sion to be achieved than would otherwise be the case. As the light irradiance within the cavity 
increases, the dye can no longer absorb, that is it bleaches and <2-switching occurs. Passive 
Q-switching has the great advantage of being extremely simple to implement involving 
nothing more than a dye in a suitable solvent held in a transparent cell. Suitable dyes include 
cryptocyanine for ruby lasers and sulfur hexafluoride for C0 2 lasers. 

As we mentioned in section 6.3.2, lasers that use a saturable absorber for ^-switching are 
also mode locked if the dye, once bleached, recovers in a time short compared with the dura- 
tion of the mode-locked pulses. 



( 6 - 5 J Laser applications 

In the time which has elapsed since Maiman first demonstrated laser action in ruby in 1960, 
the applications of lasers have multiplied to such an extent that almost all aspects of our daily 
lives are touched upon, albeit indirectly, by lasers. They are used in many types of indus- 
trial processing, engineering, metrology, scientific research, communications, holography, 
medicine, and for military purposes. It is clearly impossible to give an exhaustive survey of 
all of these applications and the reader is referred to the selection of texts and journals given 
in ref. 6.7. Rather than attempt the impossible, we discuss the properties of laser radiation 
which make it so useful and reinforce this discussion by brief mention of some appropriate 
applications. In addition, a rather more detailed description of one or two selected applica- 
tions is given in sections 6.6 and 6.7. 

6.5.1 Properties of laser light 

In considering the various properties of laser light we must always remember that not all of 
the different types of laser exhibit these properties to the same degree. This may often limit 
the choice of laser for a given application. 

6.5.1.1 Directionality 

Perhaps the most arresting property of laser light is its directionality. Apart from semicon- 
ductor junction lasers, lasers emit radiation in a highly directional, collimated beam with a 
low angle of divergence. This is important because it means that the energy carried by the 
laser beam can be collected easily and focused onto a small area. For conventional sources. 
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where the radiation spreads out into a solid angle of 4jt sr, efficient collection is almost impos- 
sible, while for lasers the beam divergence angle is so small that efficient collection is pos- 
sible even at large distances from the laser. 

The extent of beam divergence is set by diffraction (section 1 .2.4). This is a fundamental 
physical phenomenon, rather than an engineering limit that can be improved by better 
optical design. The angle of divergence in radians at the diffraction limit is given by 0, 
where 

H = .ia/D (ft.71 

f) is the diameter of the aperture through which the beam emerges, and 'M is a numerical 
factor of the order of unity. The precise value of Jf depends on the nature of the beam. For 
example, a TEM ()I] beam has a Gaussian profile (section 5.9.2), and Fig. 6.12 illustrates how 
the beam diverges outside the laser cavity. We see that the beam divergence angle B tends 
asymptotically to the value siir'(iv/r). Since from eq. (5.37) we also have that w = z\/n\\\» 
it follows that at relatively large distances from the laser cavity 9 = .sin~ I (X./Wn)-Now \/mv n 
is usually much less than unity, so that we finally obtain Q-X/mv . Assuming that we may 
■•ssociate the minimum beam diameter 2ir with an aperture diameter D, we see that this result 
• threes with the general divergence equation (6.7) with the parameter J{ taking the value 2/ti. 
The beam divergence tends to increase with increasing power output and mode content. Table 
( i irives some typical beam divergence angles. 

The beam may be further collimated by passing it in the reverse direction through a tele- 
cope as illustrated in Fig. 6. 1 3. The beam is enlarged by the factor/-,//, and hence the diver- 
\ -nee, which i.s inversely proportional to the beam diameter, is decreased by the factor/,//",. 
The ratio of the beam diameters before and after the collimator is given by 




19 Divergence of a laser beam outside the laser cavity. 

RLE 6.1 Typical laser beam divergence angles 
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FIG. 6.1 J Schematic diagram showing the collimation of a laser beam using a telescope having an eyepiece 
lens of diameter D, and local length f, and an objective lens of diameter D , and focal length f,. The beam 
width is enlarged by the factor D,/D ] = 1,/t] and the divergence angle is decreased by the factor fjf } . 

The directional nature, in particular, of gas laser beams readily lends them to applications 
involving accurate alignment, including civil engineering projects such as drainage and tunnel 
boring, surveying and the assembly of large aircraft and ships, etc. (for additional references 
sec ref. 6.7e, pp. 245-6). 



EXAMPLE 6.4 Beam collimation 



We may calculate the reduction in the divergence of a laser beam which is collimated 
by passing it through a telescope with an objective lens to eyepiece lens focal length 
ratio of 30: 1 . Let us consider an He-Ne laser with a plasma tube diameter of 3 mm. 

The divergence 9 of the beam from the laser is given approximately by X/D, 
Therefore 9, - 633 x K) _ 7(3 x 10"- ) - 2.1 x ICT 4 rad (or 0.7 minutes of arc). 

Hence, after collimation the angle of divergence will be reduced by a factor of 30, 
to 9, = 7 x \0' b rad (or 1 .4 seconds of arc). 



6.5.1.2 Linewidth 

Laser light is potentially extremely monochromatic but, as we saw in Chapter 5, the spec- 
tral content of the laser radiation may extend over almost as wide a range as the fluorescent 
linewidth of the laser medium. In other words, although the linewidth of an individual cavity 
mode may be extremely small there may be many modes present in the laser output. We 
saw in section 6. 1 how single mode operation and frequency stabilization can be achieved. 
The high spectral purity of laser radiation leads directly to applications in basic scientific 
research including photochemistry, luminescence excitation spectroscopy, absorption and 
Raman spectroscopy and also in communications. Many other applications also depend, in 
part, on this property (ref. 6.7e). 



6.5.1.3 Beam coherence 

One of the characteristics of stimulated emission is that the stimulated wave is in phase with 



LASER APPLICATIONS 2 hi 

the stimulating wave; that is. the spatial and temporal variation of the electric field of the 
two waves are the .same. Thus in a 'perfect' laser we would expect the electric field to vary 
with time in an identical fashion for every point on the beam cross-section. Such a beam 
would have perfect spatial coherence. Another related property is temporal coherence, which 
refers to the relative phase relationship of the electric field at the same place as a function 
of time. If the phase changes uniformly with time, then the beam is said to show perfect tem- 
poral coherence. These ideas are illustrated in Fig. 6.14. 

Coherence is often specified in terms of the mutual coherence function y u {z) (see rcf. d.K). 
This quantity, which is in fact a complex number, is a measure of the correlation between 
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FIG. 6.14 



Illustration of coherence, (a) A perfectly coherent beam in which all the constituent waves are 



hase at all times, (b) A beam which is spatially coherent but which exhibits only partial temporal 
' n , , „rp This is because the waves simultaneously change their phase by an identical amount every few 

mi tions. (c) An almost completely incoherent beam where the phases of each wave change randomly at 
° S d m times. Note, however, that even in this case some small degree of temporal coherence remains, 
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the light wave at two points P, and Pn along the direction of propagation at different times 
t and r + z (temporal coherence), and at points P, and P, on a plane perpendicular to the direc- 
tion of propagation at the same time (spatial coherence) (see the representative points on 
Fig. 6.14a). It has an absolute value between and 1. When it has the value zero the light 
is completely incoherent, while a value of unity implies complete coherence. Although these 
extreme values are never achieved in practice, the light from a gas laser operating in a single 
transverse mode has a value quite close to unity. 

Two useful quantities that are related to temporal coherence are the coherence time and 
the coherence length. To understand these we consider what happens when we take a beam, 
split it into two equal pans, let the two components travel different distances and then recom- 
bine them to form interference fringes as in the Michelson interferometer described in section 
6.6. 1. Interference effects will only be observed if the path difference of the two beams is 
such that they are still coherent when they recombine. Light beams from 'real life' sources 
cannot be represented by infinitely long wave trains, so there is a limit to the path differ- 
ence. Disregarding lasers for a moment, let us consider a group of atoms undergoing spon- 
taneous emission. Each atom emits radiation independently of the other and does so for only 
a finite time. This is often because the emission process is perturbed in some way, for example 
during a collision with a neighbouring atom as described in section 5.7. Thus each atom gen- 
erates a finite-length wave train and for simplicity we regard the wave trains from all the 
atoms as having the same length (L L ). Since, however, the atoms are emitting spontaneously, 
the wave trains are not in phase with each other. If now we pass such a beam into the 
Michelson interferometer, each individual wave train will be split into two, and the two split 
wave trains will be able to interfere provided the path difference does not exceed L r We 
refer to L L . as the coherence length. If the path difference does exceed L c , then the two halves 
of each wave train cannot overlap in time when they are recombined, and hence they cannot 
interfere. In addition, when the path difference lies between zero and L L „ then only a part of 
each wave train can take part in interference (Fig. 6.15). This implies that as we increase 
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the path difference from zero up to L c , the irradiance fluctuations corresponding to con- 
structive and destructive interferences will gradually diminish and die out altogether at L c . 

In reality of course, the wave trains are not all the same length but are distributed in some 
way about a mean value. The conclusion that the irradiance fluctuations will gradually reduce, 
however, still remains valid. Similarly the exact definition of coherence length is somewhat 
more involved, but we may still take it to be the path difference at which the irradiance fluc- 
tuations die out. 

The coherence time t c may then be defined as the time taken for the source to emit a wave 
train of length L,. Thus 

tt =Ljc (6.8) 

where c is the velocity of light. In fact it can be shown (ref. 6.8) that the coherence time is 
plated to the linewidth of the emission (Av) via the equation 

t =— (6-9) 

c Av 

Although we started this discussion of coherence by considering spontaneous emission, the 
. ■ s f coherence length and coherence time apply equally well to laser radiation. 

MPLE 6 5 Coherence lengths of conventional and laser radiation sources 



We consider first of all the light emitted from a low pressure sodium lamp. A typical 
linewidth of the sodium D lines (both lines taken together) at Xo = 589nm is 
5 1 x 10" Hz. Equation (6.9) then gives / c = 2 x 1CT 12 sand from eq. (6.8), L c = 0.6 mm. 
We may contrast these values with those applicable to an He-Ne laser. If many modes 
operating, then the linewidth is about 1500 MHz giving a coherence length of 0.2 m. 
However, if the laser is operating in a single mode stabilized to 1 MHz, then the coher- 
ce length is some 1500 times greater, that is 300 m. 

t>i s for there to be coherence at the points P; and P 2 in the direction of propagation, the 

ration of p t an d p 2 should be less than L c . 

Consider now a perfectly collimated beam of monochromatic light; then the phase is the 

at any point on a surface perpendicular to the beam, and the points P, and P 3 may be 

S rated by quite large distances. On the other hand, for a non-monochromatic, incompletely 

ll'mated beam the points P[ and P 3 can only be a fraction of a millimetre apart if there is 

° h any coherence between them. We sometimes define a transverse coherence length, L v 

. « re presents the distance perpendicular to the main propagation direction over which the 

W s at two points remain correlated. If there is coherence between the points P, and P 3 then 

P rference effects should result if the light from these points can be brought together. In prin- 

111 this can be accomplished using the Young's slits apparatus described in section 1.2.3. 

C1P p 'ample 6.5 illustrates how the presence of several modes can dramatically reduce the 

oral coherence of a laser, and the same is also true of the spatial coherence. A well- 

16 h lized laser emitting a single transvere mode exhibits almost perfect spatial coherence 
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across the entire beam cross-section. It has been possible to expand the TEMqq mode of gas 
lasers to give transverse coherence lengths of several metres for holographic purposes (see 
section 6.7). On the other hand, if a number of transverse modes are present the spatial coher- 
ence is considerably reduced. 

While the coherence of the output from CW gas lasers can be very high, that from pulsed 
lasers is usually somewhat smaller. This is because the temporal coherence may be limited 
by the duration of the spikes within the laser pulse or by shifts in frequency during emission. 
Thus, ruby lasers emit spikes that are typically 0.6 (is in duration and exhibit coherence times 
of about 0.1 lis. The coherence lengths of some common lasers are given in Table 6.2. 

Coherence is important in any application where the laser beam will be split into parts. 
These include interferometric measurement of distance (section 6.6) and deformation, where 
the light is split into parts that traverse different distances, and holography, where the light 
beams traverse different paths which may be approximately equal but which may have spa- 
tially different distributions. 

One of the most striking characteristics of laser light reflected from rough surfaces is its 
speckled or granular appearance. This is the result of a random interference pattern formed 
from the contributions of light reflected from neighbouring portions of the surface. In some 
regions, these contributions will interfere constructively, while in others they interfere 
destructively. This behaviour is a direct consequence of the high coherence of laser light. 

In many applications, for example holography, the speckle pattern is a nuisance, though 
the phenomenon is finding application in a number of areas including metrology and vibra- 
tion analysis (ref. 6.9). 

6.5.7.4 Brightness 

The primary characteristic of laser radiation is that lasers have a higher brightness than any 
other light source. We define brightness as the power emitted per unit area per unit solid 
angle (sometimes the term specific brightness, i.e. brightness per unit wavelength range, 
W m" 2 sr"' AX -1 , is used). (In radiometry this unit - brightness - is called the radiance, see 
section 1 .5, but in laser work the term brightness as defined here is used.) The relevant solid 
angle is that defined by the cone into which the beam spreads. Hence, as lasers can produce 
high levels of power in well-collimated beams, they represent sources of great brightness. 



TABLE 6.2 Summary of coherence lengths of some common lasers 

Laser 



Typical coherence length 



He-Ne single transverse, single longitudinal mode 

He-Ne multimode 

Argon multimode 

Nd:YAG 

Nd:glass 

CaAs 

Ruby: 

for whole output pulse 

within a spike-forming part of the pulse 



Up to 1000 m 
0.1 to 0.2 m 
0.02 m 
10" 2 m 
2 x 10" 4 m 
1 x 10~ 3 m 



10" 2 m 

<s c times spike length, i.e. «30 m 
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The brightness is also affected by the presence of additional modes, for often as the laser 
power i,s increased the number of modes increases but the brightness remains almost 
constant. Typical values of brightness are: for an He-Ne laser, 10 H 'Wm ; sr '; for a 
/;.. switched ruby laser, 10 lfl W m" : sr"'; and for an Nd:glass laser followed by amplifiers, 
10 :1 Wm" 2 sr _1 has been achieved. For comparison, the brightness of the sun is about 
l.3x K/'Wrn^sf 1 ! 

High brightness is essential for the delivery of high power per unit area to a target; this 
in turn depends on the size of the spot to which the beam can be focused. 



, r 7_j focusing properties of laser radiation 

The minimum spot size to which a laser beam can be focused is determined by diffraction. 
^y e have S een in section 5.9 that all laser beams possess a degree of divergence, even though 
in some cases this might be quite small. We can 'reverse' this divergence by inserting a suit- 
• ble focusing lens into the beam; the lens brings the laser beam to a focus at a distance nearly 
equal to/, its focal length (section 1.3.2). 

If the laser beam has a radius vv L at the lens then by analogy with eq. (5.7), we may 

write 



V 

here r represents the focused beam radius, or 'spot size'. Hence 
r =— (6.10) 

Tf the laser beam exactly fills the lens aperture we can set 2w L = D, where D is the diame- 
ter of the lens, so that 

r ^Zt = ~-KF (6.10a) 

s v.D % 

here F (=//^) is the ^ num ^ er or " me ' ens - ^ ' s impracticable to work with F numbers much 

aller than unity so that r s is of the order of X. It is interesting to note from eq. (6. 1 0) that 

larger the beam diameter at the focusing lens the smaller is the focused spot radius r K . 

this reason the laser beam is often passed through a beam expander before being focused 

onto the workpiece. 

Once again the presence of a complicated mode structure in the beam is deleterious in 
in this case, the focused spot size is much larger and the power density (irradiance) is 
1 rr'espondingly much smaller for a given laser power. Again, if the beam divergence is large 
°h oower density is reduced. Nevertheless, doped insulator lasers, because of the very high 
1 ak powers they generate, can easily produce very high irradiances. A focal area of 1 0" 7 m" 2 
P 6 ica ] for such lasers, giving rise to typical average irradiances of 10 9 Wm" 2 and peak 
fiances of 10 I2 Wm- 2 . 
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EXAMPLE 6,6 Focused power densities of laser radiation 



We consider a 10 mW He-Ne laser focused by a lens with an F number of 1. 
From eq. (6. 10a) the radius of the focused spot is 

r s = 632.8 x 10" 9 x(2/rc) m 

Hence the power per unit area is equal to 

10 x 10--'x(n) 



(632.8 x 10^ x2) 2 



= 2x !0 lu Wm " 



As we shall see in section 6.8 such high irradiances lead to the use of lasers in the drilling, 
cutting, welding and heat treatment of a large number of different materials (ref. 6.10). In 
certain applications, for example the micromachining of electronic components, good focus- 
ing is required and hence we would wish to use a short focal length lens. This, however, 
may be impracticable on a production line because of the limited depth of focus (or field). 
We must provide sufficient depth of focus to allow for vibrations and inaccuracy in posi- 
tioning in the vertical sense. The depth of focus is the distance that we can move the work- 
piece away from the position of minimum beam radius and still have an acceptably small 
spot of light. From eq. (5.34) we know that the beam radius varies with distance in a para- 
bolic manner. Rearranging this equation we have 

Kn 1 (6.11) 



%w\ 



Thus if we know the maximum variation in spot size that is acceptable, that is the maximum 
value of w(Z)/w () , then eq. (6. 1 1 ) enables the maximum tolerable variation in z to be calcu- 
lated. We note that the smaller we make the minimum beam radius, the smaller is the depth 
of focus, and we have to reach an acceptable compromise between sufficiently large depth 
of focus and small focal area. 



EXAMPLE 6.7 Depth of focus for a C0 2 laser beam 



We consider a C0 2 laser beam (\= 10.6 urn), which is passed through a beam 
expander to fill the 50 mm diameter aperture of a focusing lens which has a 200 mm 
focal length. Using eq. (6.10), we can calculate the radius of the focused spot, which 
we here equate to w a in eq. (6.1 1), that is 

10.6 xl0" 6 x 200 x 10" 3 

r s = w o = „ .-_-, = 30 u.m 

7t x 25 x ] 

We suppose that a 10% variation in the spot size can be tolerated, that is w(L)/vv = 1.1. 
Hence the depth of focus is given by 

jt(30x 10 " 6 ) 2 



10.6 x 10" 6 
or 0.12 mm. 



Ki.ir-i] 
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The selection of a laser for a given application involving laser 'heating* depends very 
much on the nature of the application. For many cutting tasks it may be advantageous to 
use a CW laser; for continuous output, the highest powers are produced by CO : lasers for 
which values up to 100 kW have been quoted. For welding operations, a pulsed laser may 
he preferred. In this case, because of the very short pulses which can be produced, we find 
that ^-switched Nd:glass lasers generating pulses with peak powers of about 10" W are 
commercially available. 

The focusing properties of laser radiation are also important in low power applications, 
two of which represent the first laser-based devices to be used by the public at large. The 
first of these is the 'point of sale' device used to price items in supermarkets and to provide 
for automatic stock information upgrading. Products have a coded label, consisting of a series 
of parallel bars of varying widths, placed on them. This is scanned by a laser. The light 
reflected from the bars is detected, thereby identifying and pricing the product and adding 
the price to the bill. 

The second application is in the preparation and readout of video-disk systems. The infor- 
mation is imprinted on the video-disk in digital form by forming small pits in the surface 
f tne disk with a laser. These pits are subsequently read by a low power laser to provide a 
video signal for playback on a television set. The system has a number of advantages over 
e or stylus pick-up systems including the absence of wear, the high information density 
that can be accommodated by the closely focused laser beam and the fact that warped disks 
•an be played equally well. Also high quality 'frozen' images can be selected at will and 
held indefinitely. Similar arrangements are used in compact disc (CD) audio systems, and 
becoming increasingly popular for archival and computer storage (ref. 6.1 1). 

6 5.7.6 Tunability 

yi/ e s aw in Chapter 5 and the earlier part of this chapter that some lasers can be tuned to emit 

• diation over a range of wavelengths. With dye lasers, for example, the range of tunabil- 

can be large. Indeed dye lasers can be turned over most of the visible spectrum, and by 

onic generation this range can be extended into the ultraviolet. On the other hand, optical 

- rametric amplifiers used with a primary laser source can upconvert into the range l -25 jim 

in the infrared (section 3.9.1). 

Laser tunability leads to applications in photochemistry, high resolution and Raman spec- 
oscopy and isotope separation (ref. 6.12). 



6.6 Measurement of distance 

main methods of measuring distance using lasers are: (a) interferometry; (b) beam mod- 
don telemetry, and (c) pulse time of flight (ref. 6.13a). 



ulation 



-« Interferometric methods 

0,0' ' 

sa w in Chapter 1 that if the wavefront from a light source is divided into two parts which 
traverse different distances before being recombined, then an interference fringe pattern 
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is produced. The irradiance distribution in the pattern is characteristic of the point-for-point 
path differences between the two parts of the beam. Thus, if one of the path lengths is changed 
the fringe pattern will move across the field of view and the change in path length can be 
measured in terms of the fringe shift. 

The classical method for measuring distance (or changes of distance) in this way is the 
Michelson interferometer and nearly all other methods are variations of this instrument. The 
Michelson interferometer, which is shown in Fig. 6.16, consists of a beam splitter, two plane 
mirrors and an observing telescope. The wavefront from the laser source is divided by the 
beam splitter B; the two parts then proceed to the plane mirrors M, and M 2 and are reflected 
back to B. Some of the light is reflected by the beam splitter and some is transmitted as shown 
so that the beam splitter serves to recombine the beams and interference fringes can be seen 
through the telescope. We may regard the fringes as being produced in the thin film formed 
between the mirror M, and M 2 which is the reflection of minor M 2 in B. Thus if M, and M ; 
are exactly parallel, that is M, and M 2 are exactly perpendicular to each other, a system ot 
circular fringes will be seen as explained in Chapter 1. On the other hand, if one of the mirrors 
is tilted slightly, then a system of straight line fringes is formed. 

We saw in Chapter 1 (eq. 1.23) that for thin film interference a bright fringe is formed 
when 

pX = 2D cos 9 - 2D (if 8 is small) 

where D is the optical thickness of the film. Hence if one of the mirrors, M 2 say, is moved 
D will change and the fringe pattern will move. Specifically, if D changes by X /2 a com- 
plete fringe will pass a reference point in the field of view. Therefore we can measure the 



^^•M, 




Telescope or 
fringe counter 

FIC. 6.16 Michelson inteferometer; M, is a fixed mirror, M, is movable and M', is its image in the beam 
splitter B. interference fringes can be seen through the telescope, or alternatively the number of fringes 
crossing an aperture placed in position A can be detected using a photodetector and counted 
electronically. 
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distance moved by M, in terms of fringe shifts, that is in terms of the wavelength A. (l of the 
hghl used. To measure an unknown distance one simply aligns M, with one end and counts 
the fringe shift as it is moved until it coincides with the other end of the distance being mea- 
sured. 

Optical interferometry predates the laser by many years, of course, but the technique was 
;d ways limited by the coherence limitations of the light sources available. Distances of a few 
centimetres could be measured at best. With an He-Ne laser, however, coherence lengths 
of many metres are available so that, in principle, we can measure up to such distances with 
an accuracy of a fraction of a wavelength. Fringe displacements of 0.01 of a fringe, equiv- 
alent to X/200, ean be detected. We must remember, however, that the distances measured 
are optical path lengths which include the refractive index of the air. Changes in refractive 
index, due to pressure and temperature variations and atmospheric turbulence, result in 
random fringe shifts and thereby limit the distance which can be measured and the accu- 
racy attainable. Accuracies of about 1 part in lO^ can be achieved quite readily. In practice 
the plane mirrors are replaced by cube comer retroreflectors (Fig. 6. 17). These have the prop- 
crt y of reflecting an incident beam back along a direction parallel to its incident path thereby 
simplifying the alignment of the instrument. The lateral displacement involved prevents the 
returning light from entering the laser cavity and thereby creating an undesirable modula- 
tion of the laser output. The large number of fringes which cross the field of view is counted 
■ lectronically using, for example, a silicon photodiode as a detector. 

The technique is widely used in machine tool control, length standard calibration and for 
seismic and geodetic purposes. 

a 6 2 Beam modulation telemetry 

As we mentioned above, owing to fluctuations in the density of the atmosphere, interfero- 
metric distance measuring methods are limited to distances not exceeding about 100 m. 
For greater distances, techniques involving amplitude modulation of the laser beam are 
iseful- The beam from an He-Ne or GaAs laser is amplitude modulated and projected to 




6.1 7 Schematic diagram of a cube corner reflector. The incident light reflects once off each side face 
merges along a path almost exactly parallel to that of the incident light. 
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the 'target' whose distance is to be measured. The light reflected from the target is received 
by a telescope and sent to a detector. The phase of the modulation of the reflected beam is 
different from that of the emitted beam because of the finite time taken for the light to travel 
to the target and return to the telescope. The phase shift <j> is given by 



2n 
r (2n g L) 



(6.12) 



where L is the target distance and n is the group index of refraction of the atmosphere. The 
value of ;? g for the 632.8 nm He-Ne laser wavelength is n g = 1.000 28 for dry air at 15°C. 
760 torr and 0.03% C0 2 . Corrections for varying atmospheric temperature and pressure are 
available. These corrections are difficult to apply for measurements in the field, however, 
and one must attempt to average n over the entire path traversed by the light. 

Figure 6. 1 8 shows the schematic diagram of a beam modulation system. The light is ampli- 
tude modulated at a given frequency/, collimated and transmitted to the target. Reflected 
light is collected by the telescope and focused onto the detector (the presence of a retrore- 
flector on the target is a great help). A phase detector compares the relative phase of the 
reflected beam with that of the original beam. 

The phase difference can be written as 

<j> = (p + q)2n 

where p is an unknown integer and q is a fraction less than unity. The phase comparison 
gives q but not/;. To find/;, the measurement must be repeated with different values of the 
modulation frequency. Having found <j> then L can be determined from eq. (6. 12). The narrow 
bandwidth of the laser light enables high discrimination against stray light so that the system 
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FIG. 6.18 Schematic diagram of a beam modulation distance measurement system. 
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can be used in daylight with a high signal-to-noise ratio, while the small beam divergence 
allows a high degree of selectivity in the target being examined. 

Beam modulation telemetry units have been developed for which the accuracy is better 
than 1 mm at distances up to 1000 m and 1 part in 10 (1 for greater distances. Such devices 
have been used for the measurement of large structures such as dams and bridges, and as 
(jrborne instruments for land profiling and for geodetic surveying. 

(, 6.3 Pulse echo techniques 

We can measure large distances by timing the round trip transit time for a very short pulse 

reflected from a distant target. The system consists of a pulsed laser, preferably (^-switched, 
telescope to collect the reflected light, a photodetector and an accurate timer. The narrowly 

rollimated beam of the laser makes it possible to measure the range to specific targets and 

the technique has military applications as a range finder. Accuracy of the order of ±5 m in 

]() km has been achieved. 

A novel application has been in measuring the distance to the moon. Using retroreflec- 

tors left on the surface of the moon during the Apollo 11, 14 and 15 space missions, the lunar 

fistance has been measured to an accuracy of ±15 cm. 

This technique, which is often known as optical radar or 'lidar' (//ght detection und 
anging). has been extended to atmospheric studies. By measuring the amount of backseat- 
red light- the presence of air turbulence can be detected and the amounts of various atmos- 
heric pollutants such as C0 2 and S0 2 can be measured (refs 6.12c and 6.13b). 



(j£t) Holography 

Although holography was developed prior to the laser (the first hologram was recorded by 
Gabor in 1 948, ref. 6. 14), the requirement of holography for light with a high degree of spatial 
d temporal coherence has closely linked the development of holography to that of lasers. 
Holography is a method of recording information from a three-dimensional object in such 
way that a three-dimensional image may subsequently be reconstructed; the phenomenon 
• often known as wavefront reconstruction. A great deal of work has been done on holog- 

hv and its applications; a selection of texts on the subject is given in ref. 6.15. 

pj ure s 6.19(a) and (b) illustrate the basic principles. A photographic plate is exposed 
. u itaneously to waves of light scattered by the 'object' and to waves of light from a 'ref- 

nce ' source. The reference beam, shown here in Fig. 6.19(a) as a plane-parallel beam, 

v be of any reproducible form and is derived from the same laser source as the light illu- 
rT1 . tin g the object. Because of their high degree of mutual coherence the two sets of waves 

duce an interference pattern on the plate, which is recorded in the photographic emul- 
So°n and forms a hologram. 
' Th e photographic plate is now processed and illuminated with only the reference beam 

, en t as shown in Fig. 6.19(b). Most of the light from the reference beam passes straight 
prei, u ^ the hologram; some of it, however, is diffracted by the interference pattern in the 

°ision. By the normal diffraction grating equation (eq. 1.28), light of wavelength X will 
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FtC. 6.19 Schematic diagram showing the basic geometry for (a) making a hologram by recording the 
interference pattern produced by the interference of light reflected from the object, and the reference 
wavefronts, and (b) reconstruction of the object wavefront. To the observer the reconstructed wavefront 
appears to be coming from the object itself and a virtual image is seen. 



experience constructive interference at angles such that X = d sin 8, where d is the local fringe 
spacing of the interference fringes whose exact shape and distribution depends on the shape 
of the object and the wavefronts reflected from it. Thus the constructive interference of these 
diffracted waves reconstructs the original wavefronts from the object and to an observer the 
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wavefronts appear to be coming from the object itself. These wavefronts constitute what is 
termed the virtual image. However, just as a diffraction grating gives diffracted orders on 
cither side of the 'straight-through' position, the hologram generates a second image: this 
image, which is usually inferior in quality to the virtual image, is called the real iniii^c. 

The hologram serves as a 'window' on the object scene, which has been illuminated by 
the laser, through which the object can be viewed from different angles. The range of views 
(> f the reconstructed object is limited only by the size and position of the hologram and a 
truly three-dimensional effect is created. 

The mathematical analysis of holography is quite complicated, but the following simpli- 
fied treatment will serve to illustrate the principles involved. We assume that the photographic- 
plate is in the (x,y) plane and that we may represent the electric field of the wavefront reflected 
from the object in the (x,y) plane at time t by 

-£ ob = £/„(*, y)exp(-iu>r) 

where U (x,y) is the amplitude, which in general is complex. Similarly, the complex ampli- 
tude of the reference beam in the (x,y) plane at the same instant is U T (x,y). As the object 
and reference beams are coherent, the irradiance recorded on the photographic plate is given 
by adding the amplitudes and multiplying by the complex conjugate. Thus the irradiance 
is 

Iix ,y) = \U ^U r \ 2 -(U Q + U T )(U^U:) 

Therefore 

/(x,y) = (W* + W) + ««* + tfo^r) (6.13) 

The first term in eq. (6.13) is the sum of the individual irradiances; the second term repre- 
sents the interference which occurs and thus contains information in the form of amplitude 
nd phase modulations of the reference beam. 
The plate is now processed to form a transmission hologram. With correct processing, 
the transmission of the hologram is a constant (say T) times the irradiance function /(.v. v) 
given in eq. (6.13). If the plate is now illuminated by the original reference beam only, the 
transmitted light will have a complex amplitude U T which is proportional to U r times the 
transmittance of the hologram Tl(x,y). Hence we may write 

Ujix. y) = VFb- >') 

= T[U£U U* + U r U?) + U 2 U* + U T U?U ] (6 . 14) 

a s we mentioned above, the hologram behaves like a diffraction grating and produces a direct 
beam and a first-order diffracted beam on either side of the direct beam. The first term in 
(6.14) represents the direct beam. The last term is the one of greatest interest; U T U* is 
constant so that the last term is essentially U , the object wavefront amplitude. Hence this 
diffracted beam represents a reconstruction of the wavefront from the original object and it 
forms the virtual image. The middle term represents the other diffracted beam and forms 
the real (conjugate) image. 

This description can be verified by considering the simple case of an object comprising 
single white line on a dark background. The hologram, in this case, turns out to be a simple 
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periodic (or sine) grating. The zero order of the diffracted light is the direct beam, whereas 
the first orders on either side comprise the virtual and real images. 

6.7.1 Applications of holography 

Undoubtedly, the full potential of the holographic technique is still to be realized although 
a number of applications are now firmly established (ref. 6.16). We shall describe one estab- 
lished and one potential application, namely holographic interferometry and computer mem- 
ories respectively. 

6.7.1.1 Holographic interferometry 

The determination of surface contours by conventional interferometry has been restricted 
to the examination of reflecting surfaces with simple shapes. This restriction is removed by 
holographic interferometry, which can be used for complicated shapes with diffusely reflect- 
ing surfaces. There are a number of recognizably different types of holographic interferometry 
which we now describe briefly. 

Double exposure holographic interferometry is an important industrial process in which 
very small displacements or distortions of an object can be measured by counting interfer- 
ence fringes. The subject of the investigation is recorded as a hologram and before the plate 
is processed the subject is moved, distorted by stress or whatever, and a second hologram 
recorded. After processing, each image can be reconstructed in the usual way. The two sets 
of wavefronts for the reconstructed images interfere and produce interference fringes over 
the full range of views of the subject obtainable through the hologram. A typical example 
of this technique is given in Fig. 6.20 which shows a circular membrane which has been 
deformed by a uniform pressure. The time between the two records may be anything from 
a fraction of a microsecond upwards, but the plate and subject must maintain the same re- 
lative positions except for the movement under investigation. 

A variation of the technique is real time holographic interferometry, in which the inter- 
ference fringes are viewed in real time. A hologram of the subject is recorded as above but 
in this case the plate is processed and replaced in its original place. The subject is now dis- 
torted and interference fringes can be observed through the holographic plate, changing as 
the distortion of the subject actually occurs. Although real time holographic interferometry 
provides a sensitive tool for measuring the strains of objects as they actually deform, it suffers 
from a number of problems. These include the difficulty of replacing the plate exactly in its 
original place and distortion of the photographic emulsion during processing. Figure 6.21 
shows the fringe pattern in real time holographic interferometry as the object, a metal bar 
which is clamped at one end, is stressed. 

The third technique, time-average holographic interferometry, is particularly useful for 
examining the spatial characteristics of low amplitude vibrations of an object. In most holo- 
graphic situations, a general rule is that the object should remain stationary during the period 
of exposure. In the present case this rule is violated dramatically, for during the exposure 
the object is moving continuously. The resulting hologram may be regarded as the limiting 
case of a large number of exposures for many different positions of the surface. The fringes 
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F ,C.6.20 Double exposure holographic interferogram showinc the deformation of, • , 

whtch nas been caused b y a umform preBure . (Ph ^ J^':^ 

University of Michigan.) h ^ M - v ™. 




FIC .6.21 Illu^at.onofrealt.rneors.ngle exposure holographic interferometry Interference of th > , 
obj ect wave w,th the reconstructed object wave shows the deformation of the Z Z t l 
are no t horizontal indicates that the bar has suffered a twist in addition to bend g F omO< h' r 7" 
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produced represent contour lines of equal amplitude of vibration of the surface. The bright- 
est fringes occur at the nodes where the surface remains stationary. Elsewhere, providing 
the period of exposure covers many vibrations of the surface, there is a variation in irradi- 
ance due to the surface motion, with almost zero irradiance at the antinodes. An example 
of the application of time-average holographic interferometry to the analys.s of the vibra- 
tions of a turbine -blade is shown in Fig. 6.22. 

6.7.2 Holographic computer memories 

Holographic computer memories are being actively developed because, potentially, they have 
a very high storage capacity-theoretically up to 10 1 " bits mirf 3 -with rapid access. Though 
they are unlikely to replace established technologies such as magnetic disk, tape or optical 
disk storage, they could be used where very large total storage capacity is required. 
Holographic storage systems are also interesting in that they involve several of the opto- 
electronic devices we have described, such as lasers, optical modulators, photodectors, as 
well as optical storage media. 

A holographic memory records and reads out a large number of bits simultaneously as 
we can appreciate by considering Fig. fi.23. The information to be stored is formed into a 
two-dimensional array of bits by a device called a/w^e composer. The page composer may 
be thought of as an array of light valves which may be open or closed corresponding to 'ones' 
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FIG. 6-23 Schematic diagram of a holographic mem 
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an d 'zeros respectively This array of up to perhaps 10 s - 10" bits is then stored at one time 
in a particular location of the holographic memory. During recording, the light modulators 
a ]fow maximum irradiance in both the signal and reference beams. To store a different arrav 
a , another hologram on the storage medium the beam deflector moves the beams to the appro 
priate position. W ,IU - 

Readout of data occurs when the hologram is addressed only by the reference beam The 
first modulator ,s partly closed to reduce the irradiance of the light reaching the holographic 
storage medium, while the second modulator is closed to cut off the signal beam The de L 
tor directs the beam to the hologram to be read via the tracking mirror and an image reore" 
senting the arrays of ones and 'zeros' is produced. This image is focused by a lens and 
projected onto the detector array such as a charge-coupled device (CCD1 arrav wh,rh i, n 
liable with 2048 x 2048 detector pixels - see seel 7.3.7. Each £! o^ mat s ed " 
the page composer is incident on one photodetector in the array. The stored information is 
thus converted into electrical signals. un ls 

The storage of data in this way offers a number of advantages compared to a bit-orientated 
memory. The mformation about a particular array is distributed in a holographic fringe pattern 
, croSS the entire hologram. Thus the hologram is not sensitive to small L-Lhes or part, e" 
of dust which might otherwise cause the loss of bits of data in a bit-orientated rnemorv 

Secondly the information is recovered essentially in parallel. A large number of bhs are 
a „ read simultaneously al owing a very high readout rate. The requirements on light be m 
deflection are also reduced. Each position to which the beam is deflected represents a , 
, f bi ts of data; thus a ^ bit memory requires only 10* separate loS,^S wS 
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the capabilities of present light beam deflectors. Addressing can be done using electro-optical 
deflectors which have access times in the region of microseconds. 

A third advantage is that the holographic recording and construction is insensitive to the 
exact position of the reference or reading beam on the hologram. The hologram can be 
moved but the focused spots remain stationary. This means that holographic memory 
systems are easier to align and less subject to problems of vibration than other optical 
memories. 

Finally it is possible to record several holograms in different planes of one thick, sensi- 
tive material provided that different reference beam directions are used to record each holo- 
gram. The appropriate hologram is then read with a readout beam aligned at precisely the 
same angle as the original reference beam in the recording process. 

Despite these advantages, the availability of suitable storage media has to date limited 
the development of holographic memories. Although photographic emulsion is satisfactory, 
it provides a permanent store only, which cannot be easily updated. Ideally the storage 
medium should have low write energy and high readout efficiency, be stable, but alterable 
so that data can be erased and new data written in. Potentially useful read/write/erase opti- 
cally sensitive materials include thermoplastic photoconducting layers, magneto-optic MnBi 
and electrophotochromic KG. Among the most attractive classes of recording materials are 
photorefractives (ref. 6.17), such as iron-doped lithium niobate (LNb0 3 :Fe), in which the 
irradiance of the incident light modulates the refractive index. A test system (ref. 6.18) using 
LiNbO^Fe has stored 450 pages of 320 x 240 pixels with effectively a zero bit-error rate. 
The storage density approaches 10 8 bits cm -3 . 



I 6 - 8 J High energy applications of lasers 

For several years now lasers have been used to perform a large number of materials pro- 
cessing functions such as cutting, drilling, welding, marking (or scribing) and the surface 
treatment of a wide range of materials. These include hard materials, for example diamond 
and ceramics, metals, wood, soft and pliable materials, and biological tissues. These processes 
have, of course, often been successfully undertaken without the need to resort to relatively 
advanced and expensive laser technology. Lasers, however, not only have significant advan- 
tages over conventional methods even for quite standard applications, but also may enable 
tasks to be undertaken which conventional techniques cannot accomplish. 

The advantages of lasers, when compared with more conventional techniques for mate- 
rials processing, include: 

1. Laser radiation is a very 'clean' form of energy, in that no contaminating impurities need 
come into contact with the workpiece. In fact the working atmosphere can often be con- 
trolled to suit a particular task. Furthermore the radiation can be directed through a 
window in a vacuum chamber to provide the ultimate in contamination-free processing. 

2. As we have seen laser beams, because of their high spatial coherence, may be focused 
into very small areas. Thus intense local heating can take place with limited effect on 
neighbouring areas. 
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It is comparatively easy to control the beam irradiances, and hence the energy delivered 
to the workpiece. 

The beam can be readily directed into relatively inaccessible locations, and steered round 
sharp corners. 

Most of the laser energy is deposited very near the surface of the workpiece (though there 
are exceptions - see section 2.8.1), thus enabling shallow regions to be treated without 
necessarily affecting the bulk. 

It i. s probably true to say that the two lasers most commonly used for materials processing 
are the C0 2 and Nd:YAG lasers. This situation will perhaps change in the future however 
as other lasers become available, which are less expensive and offer different characteris- 
tics such as emission wavelength and output power. Comparing Nd:YAG and CO, lasers 
the latter is the more versatile and is commercially available with a wide range of power 
outputs, up to several tens of kilowatts, at reasonable cost. There are some applications for 
which the Nd:YAG laser has advantages because of its shorter emission wavelength For 
the same reason excimer lasers are increasingly being used in semiconductor device pro- 
cessing, where the shorter, blue/ultraviolet wavelengths enable smaller device features to 
be delineated via photolithography. 

One of the most important aspects of laser processing is that of beam delivery to the work- 
piece. The success of a laser operation will often depend on the ability of the beam deliv 
ery system precisely to position a focused spot of radiation on the workpiece In turn the 
size of the spot, which is often of crucial importance, depends on the beam quality in terms 
of the spatial power density distribution, stability and number of oscillating modes 

Figure 6.24 shows a basic beam delivery system. It is, of course, not always possible to 
place the workpiece on a horizontal or other well-defined surface; indeed for many appli- 
cations the beam must be steered into rather inaccessible locations. Until recently beam 
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FIG. 6- 24 Schematic layout of a basic faser beam delivery system. 
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steering was accomplished by articulated arms such as that illustrated in Fig. 6.25. Such 
systems, for C0 2 lasers for example, include a set of gold mirrors for reflection through 
as many as seven articulated joints. Though such systems are resistant to damage when used 
with high power lasers there are obvious limitations, including the separation of the laser 
and workpiece, arm flexibility and focusing accuracy, and the detrimental impact of the 
optical components on beam quality. 

Increasingly, beam delivery systems use optical fibers, which, though of somewhat greater 
diameter than those described in Chapter 8, behave in essentially the same way. In practice 
the maximum diameter for silica fibers is 600 urn as beyond that the fiber is rather inflex- 
ible. While silica-based fibers similar to those used for fiber optic communications can be 
used for Nd:YAG and other near-infrared and visible output lasers, they cannot be used for 
C0 2 lasers where the emission wavelength of 10.6 urn is completely absorbed. Significant 
effort has been expended on the development of suitable fibers for transmitting laser radi- 
ation in the range 2-10 urn. Several materials have been developed for use, including zir- 
conium fluoride, sapphire, germanium, chalcogenide glasses and silver halide as well as 
hollow waveguides. Of these chalcogenide glasses are quite effective for C0 2 lasers, while 
zirconium fluoride is preferred for near-infrared radiation. 

In addition to improved flexibility and increased laser to workpiece separation, the use 
of fibers often leads to an improvement in the beam quality as it is delivered to the work- 
piece and consequently to more efficient processing (ref. 6.19). Laser beams, which may 
be multimode and have an asymmetric cross-section tend to be smoothed out as they prop- 
agate along the fiber thereby enabling a smaller focused spot to be obtained than otherwise 
would be the case. In fact the use of phase conjugate mirrors (PCMs) in non-linear optical 
phase conjugate (NOPC) systems leads to an output from the delivery fiber with a very high 
beam quality (ref. 6.20). 

Another aspect of fiber delivery systems which has particularly revolutionised Nd: YAG, 
and other near-infrared, laser processing is the ability simultaneously to view the workpiece 
during processing. Many applications require very careful control of the focus position on 
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FtG. 6.25 Schematic diagram of an articulated-arm beam delivery system for use with C0 2 lasers in 
surgery - as many as seven joints may be included in the arm. 
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the workpiece, which may require manipulation of the beam by an operator or machine vision 
system. The only way of guaranteeing the required accuracy is to use the beam delivery optics 
to view the workpiece. This can be achieved for visible and near-infrared wavelengths but 
not for C0 2 laser radiation, where the optics are either opaque to visible light or give a focus 
at a significant distance from that of the 10.6 um radiation. Finally the use of fibers enables 
the radiation from a single laser to be split and shared between several workstations. 

6.8.1 Industrial applications 

A detailed theoretical analysis of the interaction between a laser beam and a material surface 
js extremely difficult to carry out. The material parameters which must be taken into account 
include the reflectance of the surface, thermal conductivity, specific heat capacity latent heats 
of fusion and vaporization, melting and boiling points, and absorption of the laser radiation 
by the material vapour. Nevertheless the theory enables orders of magnitude calculations 
t o be undertaken to estimate quantities such as the temperature rise and rate of removal of 
material for given laser power inputs. The calculations need to take into account whether 
the laser is operating in CW or in pulse mode, in which case the pulse repetition frequency 
is clearly important (ref. 6.21). ^ y 

Given the limitations of space we shall consider, rather briefly, only three industrial appli 
cations, namely welding, cutting and drilling (or hole piercing); others are described in 
ref. 6.21. 

In the basic welding process two metals, which may be the same or dissimilar are placed 
in contact, and the region around the contact heated until the materials melt and fuse together 
Careful control is required to ensure that sufficient heat is supplied to melt a sufficient volume 
f material, but not enough to give rise to significant vaporization of the material which 
can lead to weak, porous welds. One of the problems is that the reflectance of most' metals 
decreases dramatically as the temperature approaches the melting point, requiring further 
careful control of the incident energy. 

Laser welding has to compete with many well-established techniques such as soldering 
arc welding, resistance welding and electron beam welding. Laser welding has however a 
number of advantages, including: 

1 . Minimum heat input, which results in very little distortion of the workpiece. 

2 . Heating is localized and cooling is rapid so the neighbouring heat-affected zone is small. 

3. There is no physical contact with external components. 

4> Dissimilar materials can be welded, which is often difficult using other techniques. 

e- The process can be easily automated. 

6 Faster weld rates can be obtained than from other techniques. 

7 The high quality of the resulting welds. 

Welding is normally carried out using a shielding (as shown in Fig. 6.26) inert gas such as 

^gon or helium to cover the weld area to prevent oxidation of the metals, which results in 

oor welds. The gas also helps to remove any metal vapour which may be created and which 
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FIG. 6.26 Schematic beam focusing head design for laser welding when using a shielding gas. 



may be deposited on the focusing lens. Metal vapours can also be highly absorbant, and in 
some cases can prevent most of the laser radiation from reaching the workpiece. 

Both CW and pulse lasers are used for welding. For example, continuous seam welds 
are performed almost exclusively with CW lasers operating at 500 W or greater. While CW 
lasers produce continuous welds, pulse lasers produce a train of spot welds as the beam 
moves over the workpiece. The separation of the spot welds depends, of course, on the pulse 
repetition frequency and scan speed so that they may overlap and produce a quasi-contin- 
uous weld. Clearly in situations where only a single or small number of spot welds are 
required, a pulse laser is entirely appropriate, in which case Nd:YAG or Nd:glass lasers 
are often used. 

When using multikilowatt C0 2 or high energy pulse mode lasers the welding process 
becomes rather more complicated. In these situations, when the laser beam first strikes the 
surface a significant amount of material may be vaporized forming a small hole known as 
a keyhole (Fig. 6.27). Laser energy which subsequently enters this hole is trapped and is 
carried deeper into the material than otherwise would be the case. The material around the 
keyhole then melts and fills in the hole and later solidifies to form the weld as the laser scans 
across the surface. Were it not for this phenomenon weld depths would be limited to only 
about a millimetre or so. With keyholing, however, weld depths of several tens of millimetres 
have been achieved with high power C0 2 lasers, thereby facilitating the fabrication of large 
structures. 

At the other end of the scale lasers are also used in microelectronics to weld minute elec- 
trical contacts, and to weld small devices accurately into place. Again the advantages of being 
able to deposit exactly the right amount of heat precisely to the required areas are extremely 
important. 
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FIG. 6-27 Formation of a 'keyhole' during high power laser welding. 

Industrial laser cutting uses very much the same range of lasers as is used in welding and 
the advantages of laser cutting over conventional techniques are similar to those of welding 
and essentially independent of the type of laser used. In cutting the aim is to vaporize the 
material as quickly as possible to produce a hear-affected zone (HAZ) which is as narrow 
as possible. Laser cutting is a gas-assisted process in which a gas, under pressure forces 
molten material from the region of the cut or kerf. In the case of reactive materials such a,s 
metals oxygen is normally used; this enhances the cutting rate and quality of the cut edges 
W hich are left with a thin oxide coating. In some circumstances, for example if the metal 
edg e is to be subsequently welded, the oxide layer is undesirable and an inert gas must be 
used. Similarly when non-metallic materials such as ceramic, wood, paper or plastics are 
to be cut, oxidation is better avoided and again an inert gas is employed. 

The gas stream, as in welding, is usually delivered co-axially with the laser beam and it 
is important to maintain control over the pressure and flow rate during cutting. In many cases 
carefully shaped nozzles have been designed to optimize the cutting process. 

Thicknesses of up to about 5 mm of steel can be readily cut with 500 W, CO, lasers and 
even greater thicknesses can be dealt with using higher power lasers. The'faces of the' cut 
or kerf, in comparatively thick metals and non-metals are often surprisingly straight sided' 
Tnis results from a light-guiding effect due to multiple reflections of the laser beam from 
th e sides of the kerf as illustrated in Fig. 6.28. The amount of remaining taper can be con 
trolled to a certain extent by the positioning of the focal point of the delivery lens In metal 
cutting it is best to focus on the surface of the workpiece, while for non-metals better results 

re obtained if the focus is below the surface. 
3 Laser cutting systems are frequently essentially numerically controlled systems or robots 
In t he case of cutting sheet metal, for example, a two-axis numerical controller is adequate. 
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FIG. 6.28 Sketch of the kerf produced by laser cutting. The laser beam is channelled through the material 
by multiple reflections from the walls producing a waveguiding effect. A similar effect occurs in laser 
drilling. 



In more complicated processing, however, four- or five-axis numerical control with auto- 
matic location sensing and feedback control of the focused beam may be required. The flex- 
ibility of such systems is one of the main advantages of laser cutting. For example, where 
several 'options' in a processed part are required, this can be achieved by appropriate pro- 
gramming of the system, without the need for frequent tooling changes. Such advantages 
have lead to the increasing use of laser processing systems in the automotive industry. 

Scribing and marking are similar to cutting, though the depth of the cut is very much 
reduced. In scribing a line of weakness is created in the surface of a material prior to break- 
ing it along the line. This technique is widely used in the semiconductor industry where, for 
example, processed silicon wafers are scribed prior to breaking them into individual chips. 
Similarly laser marking is a convenient way of engraving identification numbers, symbols 
or logos onto a variety of material objects. 

Hole piercing or drilling was one of the first industrial uses of lasers, when in 1965 a 
diamond die for wire drawing was drilled with a ruby laser in about one-hundredth of the 
time taken by conventional means. Since then lasers have been used to drill an amazingly 
wide range of materials including cigarette filter paper, plastic and mbber baby bottle nipples. 
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acmsol can nozzles, ceramics and glasses, and Uranium alloy parts used in turbine blades 
Again the bas.c advantages of lasers for drilling are essentially the .same as for weldinn and 
cU ,„ng. It is a technique in which highly reproducible holes can be made a. extremely n J 
rates, frequently ,n matenais which are otherwise difficult to machine, without any too 
contact. J 

Hole dimensions are described in terms of the aspect ratio, that >s the ratio of the hole 
d epth to hole diameter and taper, which is the ratio of the entrance diameter to the mul 
section diameter As with cutting the sides of the hole are slighter than we may have antic, 
pa.ed because of the waveguiding effects of the sides of the hole (ueain refer to Fin 6 ->H1 
This effect is more apparent in glass-like materials, where aspect ratios of 2^ may" be 
achieved, whereas in metals the figure is nearer 12. As might be expected, it is found that 
higher aspect ratios are favoured by slowly converging laser beams 

j n most drilling operations the surface temperature of the target material is raised above 
lhe boil.ng point of the matenal. A little thought reveals that, for a given amount of heat 
cn ergy delivered to the surface, the shorter the pulse duration the better. During longer pulses 
lhe heat deposited has more time to diffuse into the material, resulting ,n a relatively larger 
volume of matenal being heated to a lower temperature. We therefore expect pulse lasers 
to be more effective than CW ones, assuming comparable average powers. For hole drilling 
in metals the C0 2 laser suffers from the problem of poor initial energy absorption unci 
N d:YAG lasers are preferred. For non-metals the situation is reversed. Exeimer laser's Z 
effective in drilling very clean holes in polymeric and other organic materials because of a 
n on-thermal interaction of the ultraviolet radiation produced with the material. 

6.8.2 Medical applications 

One of the most rapidly developing areas of laser applications .s perhaps in medicine Many 
medical applications arise directly from the laser's materials processing capabilities 
described in the previous section. Similarly many of the advantages Jf lasers over 
conventional techniques arise from the precision and controllability, which minimize 
disruption and damage to the patient, and the possibility of delivering the laser radiation 
to somewhat inaccessible regions of the body. Again, because of the limitations of space 
we shall confine ourselves to brief descriptions of a small number of representation 
ap plications-for others see ref. 6.22. Fomentation 

One of the first, and still one of the commonest, applications is the use of CO lasers in 
general surgery. The 10.6 urn emission wavelength is strongly absorbed by water molecules 
pres ent in tissue, and the subsequent evaporation of the water leads to the physical remova 
of th e tissue (the resulting debris is usually removed by a vacuum pump). The advantage, 
over physical cutting include the limited damage to adjacent tissue and the cauterizing effec 
of the radiation on the blood vessels, which reduces bleeding. A recently introduced alter 
native to C0 2 lasers for surgery and related applications is the erbium laser that is Er YAP 
T-he erbium laser emits at a wavelength of 2.94 m , which is also strongly absorbed by water 
molecules, and the laser , s consequently particularly useful f or p rec i se , localized tissue 
re moval or ablation. Commercially available optical fibers transmit radiation more efficiently 
at 2 .94 ^m than at 10.6 urn, and it is likely that zirconium fluoride fibers will replace 
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articulated arms at this latter wavelength for many applications in the near future (ref. 6.23). 
Erbium lasers are also beginning to compete with other lasers in dentistry and especially oph- 
thalmology, which is an area where lasers have a number of applications. 

One of the earliest applications in ophthalmology was in the treatment of detached retinas 
using light from a ruby laser, which with a wavelength of 694 nm readily passes through 
the cornea and other transparent regions, to be absorbed by the red blood cells at the back 
of the eye. A pulse of radiation causes a lesion and when this heals the scar tissue formed 
reattaches the retina (in effect a spot weld). Similarly laser radiation absorbed at the back 
of the eye can be used to treat degenerative conditions associated with diabetes, which can 
lead to blindness. 

The hole drilling capability of lasers has been used to drill small holes (or fistulas) of about 
300 u.m diameter through the sclera to reduce the increased intraocular pressure in the eye 
resulting from glaucoma. Finally, another application is what is often referred to as corneal 
sculpting or photorefractive keratectomy (PRK), in which the shape of the cornea is mod- 
ified by removing (ablating) thin layers, some 50 u,m in thickness, from appropriate regions. 
This changes the curvature of the cornea so that corrections for myopia (near-sightedness) 
and hypermetropia (far-sightedness) can be effected. ArF excimer lasers (k= 193 nm) are 
probably most frequently used for these increasingly common operations, as the emission 
wavelength is very effective in ablating corneal tissue. However, high frequency harmon- 
ics of various solid lasers have also been used. 

Lasers can also be used in the treatment of cancer, particularly in otherwise inaccessible 
parts of the body, such as the larynx and neck of the womb. Indeed laser radiation has been 
used very successfully for several years in the treatment of the early stages of cervical cancer. 
A particularly promising technique for the treatment of tumours is photodynamic therapy. 
In this treatment, the patient is injected with a specially designed dye substance called HpD 
which accumulates in cancerous tissue, but which is rejected by healthy tissue. When exposed 
to laser radiation at a wavelength of about 630 nm, the HpD undergoes a series of photo- 
chemical reactions resulting in the formation of a chemical which kills the cancerous tissue. 

Lastly, in some applications the laser radiation is transmitted down an extremely fine 
optical fiber, which can be introduced into arteries, for example, using catheters. Radiation 
can then be delivered to remote parts of the body as in the removal of deposits of plaque, a 
fatty material, which reduces the flow of blood along the coronary artery. To ensure accu- 
rate positioning of the laser beam, and for example to prevent accidentally creating a hole 
in the artery, a second fiber comprising a coherent fiber bundle (section 1 0.2. 1 ) or endoscope 
is included so that the surgeon has a view of the process, as illustrated in Fig. 6.29. Similar 
techniques can be used to remove other obstructions such as blood clots. 

With the majority of laser treatments the side effects are minimal compared with con- 
ventional techniques, and patients suffer significantly less discomfort, and can often be treated 
as outpatients, which is a major factor in terms of cost and convenience. 

6.8.3 Laser-induced nuclear fusion 

For many years, research has been directed towards a system for producing controlled ther- 
monuclear reactions to generate energy. Nuclear fusion of light elements occurs within a 
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FIC . 6.29 Removal of arterial plaque using laser radiation transmitted down a fine opt.cai fiber inserted 
into the artery. A v.ew.ng fiber bundle is also included in the 'tube' inserted into the artery. 



very high temperature plasma such as exists in the sun. Until recently, laboratory exneri 
m ents aimed at reproducing these conditions were based entirely on magnetic confinement 
of the plasma, and these have advanced considerably in recent years using the tokamak 
concept. Since the early 1970s, however, with the advent of very high power lasers an alter- 
native way of producing suitable conditions has been under investigation, namely inertia! 
confinement fusion. y uwaai 

The basic concept simply involves focusing very high power laser radiation onto a target 
In the early research the target comprised glass pellets approximately 50 urn in diameter con ' 
taining a mixture of deuterium and tritium gases at high pressures or pellets of frozen heavy 
water (D 2 0) and extra heavy water (T 2 0). More recently, however, carbon-based plastic 
microspheres with a 500 urn diameter and extremely thin walls (2 x 10" 3 urn) have been foh 
seated to hold the gas mixture (ref. 6.24). A number of laser beams are directed onto the 
tar get simultaneously from symmetrically arrayed directions. Absorption of the laser radi 
ation at the surface of the pellet causes ablation (burning off) of the outer material and an 
implosion of the contents. The implosion is caused by a compressions! wave driving rad i 
ally into the material from the periphery thereby squeezing the pellet into a very dense core" 
Very high temperatures, in excess of 10* K, are produced within this core and at these tern 
peratures the velocities of the deuterium and tritium atoms are so great that the electros* 
tic repulsion of the positive nuclei is overcome and the atoms undergo fusion. A typical fusion 
reaction is 



?h + ? 



H- 



■jHe + Jn (14MeV) 



(6.15) 



The reaction yields a helium atom and an energetic neutron. For many such reactions to take 
lac e within the compressed pellet, the high temperature must be maintained for abow Ins 
^d the compression must be in the region of 1 4 : 1 . These conditions require enormous lase 
pulse energies. 

While very large numbers of neutrons have been observed following a laser pulse, these 
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are still considerably lower than that required to reach 'scientific breakeven', which is defined 
as the level at which the thermonuclear energy generated equals the laser energy input. 
Calculations indicate that a laser input pulse of perhaps 10 M W with a subnanosecond dura- 
tion may be necessary to achieve breakeven and also that laser-induced fusion may be most 
efficient in the wavelength range 300-600 nm. A series of lasers with steadily increasing 
capability has been built over the last few years for this research. These include the Shiva 
Nd:glass laser at the Lawrence Livermore Laboratory, California, which has 20 amplifier 
chains each delivering about 1 TW in a 100 ps pulse to the fuel pellets. The individual ampli- 
fier chains may have quite contorted geometries to equalize their lengths to ensure that they 
all deliver their energy to the target at precisely the same instant. The energy supplied to 
the amplifier chains is derived from a single Ndrglass master laser. A similar system called 
Nova, which is 20 times more powerful, was completed in 1985. Nova comprises eight sep- 
arate Nd:glass amplifier beam lines, all fed by a single oscillator. Each beam tine has an input 
energy per pulse of approximately 100 nJ, and a total gain of 10" gives output pulses of some 
10 kJ, thereby producing a combined energy of 80 kJ with a pulse duration of 1-2 ns, so 
that the peak power approaches 10 14 W. The 1 .06 urn emission wavelength can be frequency 
doubled to 525 nm to increase the efficiency of the fusion process. 

An even more powerful laser facility, The National Ignition Facility, has just been approved 
by the US Congress (ref. 6.25) with a target completion date of 2002. The facility, which 
again is based on Nd:glass amplifiers, will have 192 separate beam lines in which the emis- 
sion from a single laser will be amplified and undergo two levels of frequency conversion 
(from a wavelength of 1 .06 uxn to 525 nm to 350 nm) prior to hitting the target. Each beam 
line will produce output energies of 15 kJ in pulses of 3 ns duration, so the total design energy 
is about 3 MJ and peak powers will approach 10 15 W, Although neodymium-based lasers 
and amplifiers are attractive for fusion studies in view of the enormous powers they can gen- 
erate, it is debatable as to whether their use in practical power generation will be feasible. 
The main reason for this is that they cannot be cooled sufficiently rapidly to allow the repe- 
tition rate of 100 pulses per second which may be required. Gas lasers, however, can be 
cooled quickly by convection and the use of C0 2 lasers has been investigated. As suggested 
above, however, the C0 2 emission wavelength may be less favourable than shorter wave- 
lengths, and for that reason there has been some interest in the use of excimer lasers, which 
can emit in the range 200-600 nm range, to initiate fusion reactions. Developments within 
the next few years may yet realize the goal of fusion-based power generation (ref. 6.26). 



PROBLEMS 



6.1 If the halfwidth of the He-Ne 632.8 nm transition is 1500 MHz, what must be the length 
of the laser cavity to ensure that only one longitudinal mode oscillates? Estimate the 
accuracy to which the temperature must be controlled if the frequency stability is to be 
better than 10 8 Hz. Take the coefficient of expansion of the laser tube to be lO'^C' 1 . 

6.2 If a cavity mode burns holes in the gain versus velocity curve for the 632.8 nm He-Ne 
transition at the spectral halfwidth points (halfwidth = 1500 MHz), what is the veloc- 
ity of the atoms involved in the hole burning? 
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6 . 3 If one of the mirrors ,n the M,che]son interferometer moves with velocity , show tint 
the rate at which the nnges cross the field of v.ew IS 2, A . Show that this' result a 
be obtained by calculating the frequency of the beats generated between the 1,1 
reflected from the .stationary mirror and the moving mirror. (The Doppler shift of L 
frequency of light reflected from moving objects is the basis of many laser velocime 
ters -see ref.6.4e, pp. 306-15 and 334.) l 

6 . 4 Calculate the Doppler frequency shift for light (X, = 500 nm) reflected from an object 
moving at 20 m s ; what implications does your answer have for the frequency st ,bi 
lization of lasers used in Doppler velocimeters? ' 

Explain how a laser may be used to measure the width of a narrow slit from the 
Fraunhofer diffraction pattern produced by the slit. The second minimum in the dil 
fraction pattern, which is formed m the focal plane of a lens of focal length 5 m « 



6.5 



2 mm from the central maximum. What is the width of the slit (take JL = 612 8 nraV 
Discuss how this technique may be extended to the measurement of the diameter of , 
thin wire and hence used in controlling the diameter of the wire during production 

6 . 6 A CW argon laser emits 1 W at ^ = 488 nm; if the beam divergence is 0.5 mrad and 
the diameter of the beam at the output mirror is 2 mm, calculate the brightness of the 
laser. To whit photometric brightness (or luminance) does this correspond? 

6J Compare the mode-locked pulse width, and the separation between pulses produced by 

(a) an He-Ne laser operating at 632.8 nm, with a mirror spacing of 0.5 m, given that 
the emission line width is 1.5 x 10 9 Hz; 

(b) an Rh 6G dye laser operating over its full gain bandwidth of 570-640 nm with „ 
mirror spacing of 2.0 m; ' uu d 

(c) an Nd:glass laser in which mirrors are formed on the ends of a laser rod 2 m lone 
and assuming that there are 3000 participating longitudinal modes; take the refrac 
tive index of Nd:glass to be 1.54. wcu«k,- 

68 jfthehalfwidthofthe 10.6 urn transition of a low pressure CO, laser is 60 MHz c,l 
cU late the coherence length of the laser. If the cavity length is 2 m, show that not more" 
than one mode will oscillate. If we take the width of the Fabry -Perot resonances as an 
(overestimate of the spectral width of the mode, calculate the coherence length - take 
the mirror reflectance/? to be 0.95. (Seeref. 1.1c, pp. 90-6, for example.) 

69 What is the total energy release in the nuclear reaction given in eq. (6.15)? Why does 
.his value differ from that given in the text for the energy of the neutron? (You will need 
a table of nuclear masses.) ccu 
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Photodetectors 



The optical detectors discussed in this chapter mav he rU^ifi** ■ u , 
^.-Inthermaldetectors,^ 

this ,n turn results m changes In some temperature-dependent paraLt r e 1 dec Z I 
ductivny). As a consequence, the output of thermal detectors is usu Z „ ' , 

amount of energy absorbed per unit time by the detec or and ■ "? P rop0rt,0naI ,0 tflL ' 
cl ency , the same at all wavelengths, ,s ^n^^Z^Z^ 
detectors, on the other hand, the absorption process results di^h, S photon 

cve nt (such as the photoelectric ermsln oS^ '" TZ T^ qUa ' lUmi 
ed - by the detection system. Thus the output o^Z^^^T^ 
absorption of light quanta and not directly on their energy Furthemo e ,11 t h 
processes considered here require a certain min ,mum photon e^^^™ 
,he energy of a single photon Ksgwen by £ = / iV = /,rA , see ea , ~ u '™'<>te them. S.nce 
a long wavelength 'cut-off, that is a maximum mJ^^±&T J** 01 * ^ 
A problem encountered wilh photon detectors 0^^^^ T" 
energies involved become comparable with the average thermal ene's- m f ? 
the detector itself. A relatively large number of quanfum evTnt mav ' ~h ™ 

thernl al excitation rather than by hght absorption and Z con 1 t a ^ '"T* ** 
The obvious way to reduce this noise signal is to reduce Z\ ™ ° f noise - 

indee d most photon detectors operating ^ ^ ^ ^'^ * ^^^ 
to liquid nitrogen temperatures (77 K) or below. § M ° F S ° must be cooled 



(^TT) Detector performance parameters 

Before we embark on a detailed discussion of individual detector, ir k- , + - , 
of the parameters that are commonly used to assess the * rf ™ ^^ ^ W *™ 
5/v /ry tf is defined as the ratio of the detector output to its W ThV , '' M """- 

both on the type of detector and ,ts intended use' S I 1 eT ^ ^^ 
wat t. If the device is intended for use in the visible then IZfnll ? * ^ V ° ItS) P&r 

^e reS ponsivity will vary with wavelength ^^^Z^T^^' 
resp onse is usually given as a curve of R, vesus wavelength ' ^ ^ Spectral 

It i s obviously important that a detector is able to follow the virion ■ ■ 
inComin g radiation as closely as possible. For example, a sudden te Z '" '^T ° f 
sh ould produce a similar step change in the output of the detector. InevTbTv there a^ Zto 
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to all detector performances in this respect. Many detectors respond to such a step change 
with an exponential rise or fall, characterized by a response time or time constant t. As is 
shown in Appendix 5, this implies that the detector will have a responsivity which depends 
on the light modulation frequency /'according to 

/?(0) 

^'- (l+^/V)'* < 7 -" 

The upper operating frequency (the cut-off frequency, f J of a detector can be defined as the 
frequency at which the electrical power output falls to one-half of its maximum value. Since 
the responsivity of an optical detector inevitably involves either a current or voltage then 
we are looking for the frequency at which the responsivity falls to 1/V2 of its maximum value. 
If the responsivity is given by eq. (7.1) where R{f) has its maximum value at/=0, the cut- 
off frequency is given by/ L .= 1/(2jit). Obviously .a detector can only give a faithful repre- 
sentation of an input signal if the highest frequency content of the signal is less than/;.. 

Another important aspect of a detector is its sensitivity, that is its ability to measure very- 
small optical signals. The limitations on the size of signal that can be detected come about 
because of the presence of noise; that is, an output from the detector that is completely uncor- 
rected with the signal. The ratio of the magnitude of the output due to the signal to that of 
the noise is referred to as the signal-to-noise ratio (S/N). It is generally assumed that it will 
be difficult to measure a signal when it generates an r.m.s output equal to that generated by 
the noise (i.e. when S/N= 1). An indication of the size of the minimum detectable signal 
(or sensitivity) is given by the noise equivalent power (NEP). This is defined as the power 
of sinusoidally modulated monochromatic radiation which would result in the same r.m.s. 
output signal in an 'ideal' noise-free detector. Statements of the NEP should be given along 
with a statement of the modulation frequency, detector bandwidth, detector temperature and 
detector area. It is useful to remember that many noise sources produce what is known as 
white noise, where the noise power within a bandwidth A/is proportional to Af. Because both 
current and voltage are proportional to the square root of electrical power, noise current and 
noise voltage are then proportional to A/ !/ \ 

If we assume that the noise power generated in a detector is proportional to its sensitive 
area A, then the noise current (or voltage) will vary as A 1/2 . Thus we may define a unit NEP* 
which takes into account the effects of variable bandwidth and detector area where 



NEP 

NEP 



(AAf) 



1/2 



In fact it is the reciprocal of this quantity, known as the specific detectivity (D*), that is com- 
monly used. Thus 



D* 



(AA/) l/: 
NEP 



(7.2) 



The value of D* for a particular detector will depend on the wavelength of the signal radi- 
ation and the frequency at which it is modulated, and is often quoted in the format D*%J). 
Curves showing the variation of D* with ^ for several of the detectors that will be 
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discussed in this chapter are shown in Fie 7 l at™ ;„,t . a 

retical D* curve for a detector where the noise arte strotn ^ '^ is the theo " 
caused by the presence of blackbody radiation at 300 K it -, T^' '" Can " ier g enera ^" 
number of detectors approach this theoretical limit ^ ^ * their best ' a 
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Provided that other criteria such as frequency response are met, and that the noise is not 
background radiation limited, then at a particular wavelength the detector with the highest 
D* value is generally the best choice. 



EXAMPLE 7.1 Detector sensitivity 



From Fig. 7.1 a PbS detector has a D* of 10 9 m Hz 1/2 W _1 at a wavelength of 2 urn. 
If we assume a detector area of 25 x 10" 6 m 2 and a detection bandwidth of 10 kHz then 
the sensitivity of the device at this wavelength may be calculated from eq. (7.2). Thus 

(25 x 10" ft x 10 4 ) i/2 



NEP = 



10 9 



W 



= 5x 10 



-10 



W 



C 72 J Thermal detectors 



To gain an insight into the performance characteristics of thermal detectors, we consider the 
behaviour of the simple model shown in Fig. 7.2. The incoming radiation is absorbed within 
the sensing element of heat capacity H. This is connected to a heat sink at constant tem- 
perature r s via a heat conducting link which has a thermal conductance G. If the instanta- 
neous rate of heat absorption is given by W, then during a small time interval 8f the heat 
absorbed is Wbt. If we let the temperature of the element be T s + AT then during the same 
time interval the amount of heat lost through the thermal link is GATbt, The difference 
between these two represents the amount of heat available to raise the temperature of the 



Thermal link, 
heat conductance 
G 



Heat sink, 
temperature T t 



MMMdMHMM 



Sensing element, 
heat capacity H, 
temperature T, + AT 



Radiation 



FIG. 7.2 Model of a thermal detector used to derive the frequency response characteristics. The incoming 
radiation causes the instantaneous temperature of the sensing element to be 7 S + AT. The element is 
connected via a conducting link (of conductance Ci to a heat sink which remains at the temperature T,. 
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element. Hence we may write 

Wbt - GATbt = Hb(AT) 
jf we take the limit 5t — >0 we obtain 

d(AT) 

No w suppose that W has a time dependence given by W = W i} + W f cos(2nft) where W > W 
ant l also that AT can sirmlarly be written A7"= A7 + AT, co S (2itft + 4, ). B v substituting 'these 
relations into eq. (7.3), it may be verified by the reader (see Problem 7.3) that AT. is given 
by 

A7> J 



'~(G 2 + 4nf 2 H 2 ) 1/2 (7-4) 

For good sensitivity, it is obviously desirable to have as large a value of AT f as possible- 
inspection of eq. (7.4) shows that this implies small values for both H and G. This may be 
achieved by using thin absorbing elements of small area (to reduce H) which have minimal 
support (to reduce G). We may thus expect high sensitivity devices to be rather fragile 
Looking now at the frequency characteristics, we may rewrite eq. (7.4) as 

A7>= L 



''~G(1+4k/ 2 t£)'* 
where ? H , the thermal time constant, is given by 

x^HJG (75) 

p- or good response at a frequency/ we require x H < l/2nf. 

Thus once H has been fixed (from size considerations) then G cannot be made too small 
otherwise the response time may become unduly long. Typical values forx H found in prac- 
tice usually range from 10" s upwards, although smaller values can be achieved The lim- 
iting sensitivity of thermal detectors is governed by temperature fluctuations within the 
detector, which arise from random fluctuations in the energy flow rate out of the element 
It may be shown (ref. 7.1) that the root mean square (r.m.s.) fluctuations in the power (AW ) 
flowing through a thermal link, which have frequencies between/and/+ A/, can be written 

AWrL = (^T 2 G^Af (76) 

•j^e smallest value of G obtainable is when energy exchange takes place by means of radia 
tiv e exchange only. In this case, the minimum detectable power at room temperature for a 

l0 mm 2 area detector is about 5 x KT" W (see Problem 7.4). The best detectors available 
approach to within an order of magnitude of this figure at room temperature, and the per 
f rrnarice of some can be improved further by cooling. 

° The receiving element is often in the form of a thin metal strip with a suitably absorbant 
sU rface coating such as 'gold black' (an evaporated film of gold which is uniformly absorbant 
at wavelengths from the UV well into the IR). Mounting the element in a vacuum enclosure 
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gives increased stability due to isolation from air movement, although window transmission 
can then be a problem if a wide wavelength range is desired. To overcome drift in the output 
caused by changes in the ambient temperature, we can take the difference in output of two 
identical detectors in close proximity, only one of which is exposed to the incident radiation. 

When very large amounts of radiation are encountered (such as in the outputs from high 
power lasers), more massive detector elements are used; these are often in the form of stain- 
less steel disks or cones. 

Because of their relative unimportance in the field of optoelectronics we deal only briefly 
with a few of the better known types of thermal detector; for a more detailed discussion, the 
reader is referred to ref. 7.1. 

7.2.1 Thermoelectric detectors 

Thermoelectric detectors use the principle of the thermocouple (i.e. the Seebeck effect) 
whereby the heating of one junction between two dissimilar metals relative to the othercauses 
a current to flow round the circuit which is proportional to the temperature difference between 
the junctions. In thermoelectric detectors one junction is used to sense the temperature rise 
of the receiving element whilst the other is maintained at ambient temperature, as shown in 
Fig. 7.3. A rather more sensitive detector may be made by connecting several thermocou- 
ples together in series; the device is then known as a thermopile. Efficient operation calls 
for materials with large electrical conductivities (to minimize Joule heating effects) and also 
small thermal conductivities (to minimize heat conduction losses). These two requirements 
are usually incompatible, and a compromise has to be reached. Although metals are most 
often used for the junction materials, certain heavily doped semiconductors can offer improved 
sensitivity, but these are generally less robust and give rise to constructional problems. The 
usefulness of thermoelectric detectors lies in their simplicity and their rugged construction. 

7.2.2 The bolometer 

In the bolometer the incident radiation heats a fine wire or metallic strip causing a change 



Cold junction 




Radiation 
Sensing element 



FIG. 7.3 Thermoelectric detector: the temperature change of the sensing element induced by the 
absorption of radiation is detected using a thermocouple, one junction of which is attached to the sensing 
element, and the other shielded from the radiation. 
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in its electrical resistance. This may be detected in several ways: for example the element 
may be inserted into one arm of a Wheatstone bridge (Fig. 7.4), or in place of the photo 
conductor in the circuit of Fig. 7.17. Care must be taken to ensure that any currents flowin, 
through the element are sufficiently small not to raise its temperature by a significant amount 
The main parameter of interest in assessing the performance is the temperature coefficient 
of resistance a, which is given by 

1 dp 



a = 



p AT 



where p is the resistivity of the material and T the temperature. The resistivity of metals 
increases with increasing temperature, and hence for these a will be a positive quantity 
Platinum and nickel are the most commonly used, and both have a values of about 005 K _l " 
Greater sensitivity may be achieved by using semiconducting elements, which are some- 
times called thermistors. These consist of oxides of manganese, cobalt or nickel and have 
a values of about -0.06 K" 1 (for these materials, a is dependent on temperature) The neg- 
ative sign arises because of the characteristic decrease in resistivity with increasing tern 
perature of semiconductors above a certain temperature (see eq. 2.24 and the discussion 
following eq. 2.36). 

Carbon resistance bolometers cooled to liquid helium temperature (4.2 K) have proved 
successful in far-IR astronomy where very sensitive detectors are required. Use can also be 
m ade of superconducting materials; in these the resistivity drops suddenly to zero below a 
particular temperature (the 'critical temperature'). In operation the temperature of the element 
is held just below this value so that the absorption of even a small amount of radiation will 
cause a very rapid increase in the resistivity. The devices exhibit exceptional sensitivity but 



Radiation 




Sensing element 



FIG 



7.4 



Wheatstone bridge circuit incorporating a bolometer radiation 



sensing element. When the 



distance values are such that R : /R 2 = RjR< then the current / c through the galvanometer is zero If 
, ovve ver, the sensing element resistance changes slightly then a current will flow which i s proportional to 
resistance change. 



the 
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require that the ambient temperature be very accurately controlled. The latest designs use 
high temperature superconducting materials such as yttrium barium copper oxide (YBCO). 

7.2.3 Pneumatic detectors 

The receiving element in a pneumatic detector is placed inside an airtight chamber. Radiation 
falling on the element causes the air temperature inside the chamber to rise and hence the 
air pressure to increase. This pressure increase may be detected in several ways. In one of 
the most sensitive detectors, the Golay cell, a wall of the chamber has a hole in it covered 
by a flexible membrane silvered on its outside surface. This acts as a mirror whose focal 
length depends on the pressure within the chamber (see Fig. 7.5). A beam of light originat- 
ing from a source S passes through a grating, is then reflected from the flexible mirror to 
repass through the grating, and finally is directed onto a detector D. When no radiation is 
being absorbed within the chamber, the optics are arranged so that an image of the trans- 
mitting region of the grating is superimposed on a non-transmitting region and there is then 
no output from D. However, if the mirror changes its curvature slightly, light will be trans- 
mitted through the grating and recorded by D. The output from D is then proportional to the 
amount of radiation absorbed within the chamber. Golay cells are available which can detect 
radiation powers down to 10"" W; they are, however, rather fragile and difficult to set up. 

7.2.4 Pyroelectric detectors 

Pyroelectric detectors are a more recent development, and while they do not have the same 
sensitivity as the Golay cell they can be made with very rapid response times and are very 
robust. The incident radiation is absorbed in a ferroelectric material which has molecules 
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FIG. 7.5 Schematic diagram of a Golay cell detector. A beam of light originating from a source S passes 
through a grating. It is then reflected from a flexible mirror which forms part of the wall of a pneumatic 
chamber. The beam subsequently repasses through the grating and is directed onto a light detector D. 
Radiation absorbed within the chamber causes pressure fluctuations which in turn cause the curvature of 
the flexible mirror to change. 
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with a permanent electrical dipole moment. Below a critical temperature, the Curie tcm- 
pcrutitie T,, the dipoles are partially aligned along a particular crystallographic axis giving 
rise to a net electrical polarization of the crystal as a whole. When the material is heated, 
the increased thermal agitation of the dipoles decreases the net polarization, which eventu- 
ally becomes zero above 7" c , as shown in Fig. 7.6. 

The most sensitive material in use is triglycine sulfate (TGS), but this has an inconve- 
niently low Curie temperature of only 49°Cand more commonly used materials are ceramic 
based, such as lead zirconate. which have Curie temperatures of several hundred degrees 
centigrade. 

The detector consists of a thin slab of ferroelectric material cut such that the spontaneous 
polarization direction is normal to the large area faces. Transparent electrodes are evapo- 
rated onto these faces and connected together with a load resistor (which can have values 
as high as 10" Q) as shown in Fig. 7.7(a). A temperature change of the ferroelectric causes 
the spontaneous polarization to vary and hence also the amount of captive surface charge 
on the faces. Changes in the surface charge induce corresponding changes in the charge on 
the electrodes, thus causing a current to flow through the load resistor. This in turn results 
in a changing voltage signal appearing across the load resistor. Radiation of constant irra- 
diance will not cause any change in the charge stored on the electrodes and will consequently 
not gi ve rise to an outP 111 signal. The frequency response of the pyroelectric detector is con- 
sidered in detail in Problem 7.6. At low frequencies, the output rises from zero to reach a 
plateau when/> 1/2tct h , where t H is the thermal time constant given by eq. (7.5). 

At higher frequencies, the electrode capacitance C acts as a signal shunt across the load 
resistor R L and the output voltage falls to 1/V2 of its maximum value at a cut-off frequency 

iven by 
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IG. 7- 6 Spontaneous electrical polarization versus temperature for a ferroelectric material (schematic 

ml The polarization fails to zero at the Curie temperature T 
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FIC. 7.7 (a) Pyroeiectric detector. A slab of ferroelectric material is sandwiched between two electrodes 
(one being transparent). The electrodes are connected by a load resistor R L , Radiation absorbed within the 
ferroelectric material causes it to change its polarization. The induced charge on the electrodes changes 
and current flows through R L causing a voltage signal to appear across R { . (b) Equivalent circuit and typical 
impedance matching circuitry for a pyroeiectric detector. The varying amounts of charge stored on the 
electrodes are equivalent to a current generator feeding into the electrode capacitance C. The load resistor 
/? L is in parallel with C. Since R L is usually very high (about 1 O 1 ' or more), an impedance matching circuit 
is often employed to reduce the signal source impedance. A typical circuit using a JFET is shown here; the 
output impedance in this case is then R (* 1 kfl). 



As the voltage output is proportional to R L {again see Problem 7.6) there is a trade-off between 
sensitivity and frequency response. Typically, a detector with a frequency bandwidth of 1 Hz 
at an operating frequency of 100 Hz can detect radiation powers of about 10~ 8 W. 

Because of the comparatively large values of the load resistor encountered in pyroeiec- 
tric detectors, an impedance matching circuit is usually built into the detector. A source fol- 
lower circuit using a JFET is commonly used as shown in Fig. 7.7(b). 

pyroeiectric detectors can be made with response times in the nanosecond region and with 
a wavelength response extending out to 100 urn. They have proved very useful as low cost, 
robust IR detectors in such uses as fire detection and intruder alarms. 
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(^tT) Photon devices 
7.3.1 Photoemissive devices 

When radiation with a wavelength less than a critical value is incident upon a metal surf-ice 
electrons are found to be emitted; this is called the photoemissive or photoelectric effect (see 
section 2.6). When a photon of energy hv enters the metal it may be absorbed and give up 
its energy to an electron. Prov.ded the electron is then able to reach the surface and has enough 
energy to overcome the surface potential barrier (given by ^ where * is the surface work 
function) it may escape and photoelectric emission takes place as illustrated in Re 7 8 
If the electron is initially at the Fermi level, its kinetic energy E on emission is given by 

f = hv - e <t> 

n (7.8) 

Since, however, the electron may be below the Fermi level initially and may also suffer inehs 

tic collisions before emiss,on, eq. (7.8) in fact represents the maximum energy available io 

th e emitted electrons. No electrons at all will be emitted when hv < e* (or in terms of wave 

length, X„ > hclet,). If the probability of inelastic collisions of the excited electrons is hinh 

then only a fraction of them may be able to escape. The ratio of the number of emitted elec ' 

trons to the number of absorbed photons is called the quantum yield or quantum efficiency 

pure metals, however, are rarely used as practical photocathodes since they have low 

quantum efficiencies (~0. 1 %), and high work functions. (Caesium has the lowest value for 

e^ namely 2.1 eV.) We may divide practical photoemissive surfaces into two groups- (a) 

th e older classical types and (b) the newer negative electron affinity (NEA) types' The 

former consist of a thin evaporated layer containing compounds of alkali metals (usually 

including Cs) and one or more metallic elements from group V of the periodic table (e g 
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FlC . 7.8 Electron energy level diagram at a metal/vacuum interface illustrating the photoelectric effect 
To escape from ins.de a metal, an electron must gain at least an energy «p, where <J> is the work function 
pressed in electron volts. 
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Sb). They are often designated by an 'S' number. To some extent we may regard them as 
semiconductors, and hence most electrons must gain an energy of at least E + x (where E f 
is the energy gap and x is the electron affinity) to escape from the surface (see Fig. 2.17b). 
For example, in the material NaKCsSb ('S20') E g = 1 eV and ^^O^eV, and hence it 
should have a threshold at a photon energy of about 1.4 eV, which is indeed close to that 
observed. 

It is possible, however, to reduce the effective value of x, as far as the bulk electrons are 
concerned, provided that band bending takes place at the surface. This is found to occur when 
there are states within the energy gap at the semiconductor surface. If we suppose that a large 
number of these are close to the valence band, they will fill with holes and lead to a local 
depletion in the hole population. The resulting uncovering of negatively charged acceptor 
ions in the vicinity will lead to the formation of a depletion region at the surface very similar 
to that formed within the p material at a p-n junction. The potential drop across the deple- 
tion region leads to band bending as shown in Fig. 7.9. If we write the potential drop as V s 
then the effective electron affinity x ef{ , as far as the electrons in the bulk are concerned, is 
given by 

Xcff ~ X — ^ s 

If V s > x then we have a negative electron affinity and the effective work function for bulk 
electrons is just £ g . 

In practice, NEA photocathodes are formed by evaporating very thin layers of caesium 
or caesium oxide onto the semiconductor surface. The resulting band structure is more 
complex than shown in Fig. 7.9 but the essential features remain. Photocathodes using 
GaAs can be made which operate with quite high quantum efficiencies up to a wavelength 
corresponding to the energy gap of GaAs (» 0.9 jam). So far, it has not proved possible to 
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FIG. 7.9 The effective electron affinity of a semiconductor may be altered if band bending takes place at 
the surface. Here holes trapped in surface states cause a surface depletion region to be formed. The poten- 
tial drop V 5 across the depletion region reduces the effective electron affinity for bulk electrons from X to 

x - v s . 
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,„.,ke NEA pholoca.hocles wl,k-h operate much above I , I ,,„, wiIh .„,., ,,,,„„,,,„ . 
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7.3.2 Vacuum photodiodes 

Jn the vacuum pbotodiode, a photoemiss.ve surface (the pkotoca^Ic) is placed insulc , 
vacuum tube w, h another electrode (the anode) placed nearbv and biased Ls, o 1 „ 
aspect to ,t, as shown ,n F,g. 7.11. When the photocathode is illuminated I I tl ! M 
Irons wi „ be coHected by the anode and a current wiH flow in the cxtelt c a ' ^ 

voltage is large enough (,n pract.ee a few hundred volts) all the emitr,>,l ,.r . , , 

c , )llccte d «d the correo, .i,, be obnost independem of bL"",^™ ^7^, o 
light irradianee. " l ' '""iwiujuil 

If monochromatic radiation with a (vacuum) wavelength nf i .,n i n • • 

, in „ pr, 01 oca,r,o„e .hen ,„e nl , mt ,er of ^'^ !^K l " dd ™' 
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FIG. 7.1 1 Schematic diagram of a photoelectric ceil. Electrons released from the photocathode surface by 
the incident radiation are attracted to the anode, thus causing a current to flow in the external circuit. A 
voltage will then appear across the load resistor R L . 



and hence if the quantum efficiency of the photocathode is r\ then the current flowing through 
the external circuit, /\, is given by 



i-. =■ 



he 



(7.9) 



Unless the incident power is relatively large the outputs from vacuum photodiodes are small 
(see Example 7.2) and thus often require external amplification (this relative insensitivity 
can be useful when examining high power laser pulses, however). A much more popular 
device is the photomuhiplier which uses the same basic principles as the vacuum photodi- 
ode but which amplifies the resulting currents internally. 



EXAMPLE 7.2 Vacuum photodiode output 



Suppose that 1 u,W of radiation with a wavelength of 500 nm is incident on a vacuum 
photodiode whose photocathode has a quantum efficiency of 0.5. The current gener- 
ated is then given by eq. (7.9) as 



0.5 x 1.6 x 10" 19 x 1 x 10~ ft x500x 10" 
6.6xl0" 34 x3x 10 H 



= 2x 10~ 7 A 
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7 3.3 Photomuttipliers 

|n the photomultiplier. the photoeleetrons are accelerated towards a series of electrodes (called 
Jynodes) which are maintained at successively higher potentials with respect to the cathode. 
On striking a dynode surface, each electron causes the emission of several secondary elec- 
trons which in turn are accelerated towards the next dynode and continue the multiplication 
process. Thus, if on average 5 secondary electrons are emitted at each dynode surface for 

• a ch incident electron and if there are N dynodes overall, then the total current amplifica- 

• n factor between the cathode and anode is given by 

G = ^ (7.10) 

Considerable amplification is possible: if we take, for example. 5 = 5 and N = 9. we obtain 
ag ainof2xl0 6 . 

Four of the most common photomultiplier dynode configurations are illustrated in Fiu. 

7 12- Three of them (Venetian blind, box and grid, and linear focused) are used in "end-on" 

jbes. These have a semitransparent cathode evaporated onto the inside surface of one end 

f the tube envelope. The photoeleetrons are emitted from the opposite side of the cathode 

i. yer to that of the incident radiation. Obviously, in this arrangement the thickness of the 

hotocathode is very critical. It" it is too thick, few photons will penetrate to the electron- 

^ itting side, whilst if it is too thin few photons will be absorbed. 

In the Venetian blind type, electrons strike a set of obliquely placed dynode slats at each 

node stage; the electrons are attracted to the next set of slats by means of the interdyn- 
' le potential applied between a thin wire grid placed in front of the slats. This arrangement 
° compact, relatively inexpensive to manufacture and is very suitable for large area cath- 
1S . The box and grid type (Fig. 7. 12b) is somewhat similar in performance. In both of these, 

rv little attempt is made to focus the electrons, which is in contrast to the linear focused 



A circular cage focused types (Figs 7.12c and d). where some degree of electron focus 
3 is obtained by careful shaping and positioning of the dynodes. 

' The focused types have somewhat higher electron collection efficiencies and a much better 
nse to high signal modulation frequencies (we discuss frequency response later in this 
re>> tion). The circular cage focused type is very compact and usually used in conjunction 
se . . s ide window geometry. In this, the photocathode material is deposited on a metal sub- 
W te within the glass envelope and the photoeleetrons are emitted from the same side of 
Stf thode as that struck by the incident radiation. 

The dynode potentials are usually provided by means of the circuit shown in Fig. 7.13. 

must be taken to ensure that the voltage between the cathode and the first dynode is 

< " ar enough to maintain proportionality between cathode current and cathode illumination. 

' ar ^ ijy a voltage value is recommended for a particular tube, and in some circumstances 

when examining fast pulses) it may be preferable to use a Zener diode in place of the 

( e '^' reS istor R v in Fig. 7.13 to keep the voltage at this value. 

^*-rhe intermediate stages usually operate satisfactorily over quite a wide voltage range pro- 

. t he voltage is distributed uniformly. To maintain this uniformity, the current flowing 

v ^ e foe dynode chain must be considerably larger (say 100 times) than the anode current. 



:u)8 PHOTODETECTORS 





o 

< 




T 

— ISS^&J — 








0) 

II 

2 8 

q. g; 
° ^ 

1/1 P 
Q. QJ 



o 



(V 



o 
o ■ = 



PHOTON DEVICES H)«) 



Cathode 



<? 
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f,c - 7 - ,3 h ^°t bi f 8 cf 71 t j w resis!,,r chain - The eht vo,t ^ ^v » -^ — < 

re6 istor cham (R, R B „ R„_., etc.) wh,ch acts as a potential divider and maintains the dynodes (D D [) 
f . fc ,, .,t increasingly higher positive potentials relative to the cathode. When an amplified M U na! c'urren, '' 
be arrives at the anode, ,t flows through the anode load res.stor R L causing a vol,a U e to 'appear a< r„l i, 



Jf hlgh anode currents are likely, then the last few stages may also be biased using Zener 
diodes- 

Th e photomultipher responds to light input by delivering charge to the anode. This charec 
way be allowed to flow through a resistor R L or to charge a capacitor: the corresponding 
voltage signal then provides a measure of the input optical signal. If individual pulses need 
to be jammed (as, for example, when using photon counting techniques, which we discuss 
in action 7.3.9), then it is important to ensure that the response time of the external circuitrv 
is lesS than that of the puser.se time. This usually implies a low value for the load resistor' 
Traditionally photomultiphers have been relatively bulky devices with photocathode diam ' 
eters of 25 mm or more. Recently much smaller devices have become available contained 
jn s mall metal cans with photocathode diameters of 8 mm or so. 

■? i 3. 1 Speed of response 

rhe electrons take a finite tune , (the electron transit time) to traverse the dynode chain from 
C 3thode to anode, although ,n fact this time will not be identical for all the electrons There 
are t wo main reasons for the spread in transit times: first, the electrons have a spread in veloc 
jties when they are ejected from the cathode and subsequent dynodes: secondly they trav e 
slig htly different pa hs (in different electric fields) through the photomultipher. It is this spread 
in tra nsit times that limits the ability of the photomultipher to respond faithfully to a fast oS 
pU lse- Transit time spread may be reduced by using fewer dynodes and designing these so 
S,3t electrostatic focusmg gives the electrons very similar paths in nearly identical fields (as 
for example, in the linear focused type, shown in Fig. 7. 1 2c). Overall tube gain may be mam' 
ained when using fewer dynodes if the interdynode voltages are relatively high and if L 
dynode surfaces have high secondary electron emission coefficients. Dynode surface mater 
sals are now available w b,e have much tgher values of 5 than that of the hitherto commonly 
^ s ed Be-Cu. Pho tomultip er tubes w.th r.se times of about 2 ns and transit times of abo t 
30 ns are now readily available. In terms of the structures illustrated in Fig 7 1 2 the orde 
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of increasing speed of response is: box and grid. Venetian blind, circular focused and linear 
focused. 



7,3.11.2 Noise in photumultipliers 

All electrical systems suffer from noise, that is a randomly varying output signal unrelated to 
any input signal. In a photomultiplier there are several sources of noise which we now discuss. 

DARK CURRENT 

Even when no radiation is falling onto the photoeathode surface, thermionic emission gives 
rise to a 'dark current' which often constitutes the main source of noise in photoemissive 
devices. The thermionic emission current / T for a cathode at temperature T of area A and 
work function ty is given by the Richardson-Dushman equation (ref. 7.2) 

i v = tiAT 1 cxpl - — J (7.H) 

Here a is a constant which for pure metals has the value l .2 x lCT 6 A m~ : K~ 2 . Thermionic 
emission may be lowered by reducing the temperature and indeed this is often essential, espe- 
cially for low work function photocathode.s, although other sources of noise may dominate 
at low temperatures (e.g. electrons may be emitted from the photoeathode by radioactive 
bombardment). If the dark current were absolutely constant, it would be a relatively simple 
matter to subtract it from the total output; however, it is itself subject to random fluctua- 
tions due to the statistical nature of thermionic emission. The r.m.s. variation in the current 
is given by the same equation as for shot noise which is discussed next. 

SHOT NOISE 

Shot noise is encountered whenever there is current flow and arises directly from the dis- 
crete nature of the electronic charge. Thus when a current flows past any point in a circuit 
the arrival rate of electrons will fluctuate slightly and gives rise to fluctuations in the current 
flow at that point. It may be shown (ref. 7.3) that the magnitude of the r.m.s. current fluc- 
tuations A/„ with frequencies between /'and /+ A/ is given by 

A/>(2/pA/-) i/2 (7.12) 

where /' is the current flowing, which in the case of the photomultiplier is the sum of the dark 
current and the signal current leaving the photoeathode. 

The presence of dark-current shot noise sets a limit on the minimum detectable signal. 
We assume that if an optical signal results in an electrical output signal that is smaller than 
the noise signal, then it cannot be detected without further processing. We define the respon- 
sivity R } of the photomultiplier as i/W, where W is the optical power falling on the photo- 
cathode (see section 7.1). Therefore, the minimum detectable signal power in the presence 
of a thermionic dark current / T is given by 

(2i T cAf) U2 
W mill = (7.13) 
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EXAMPLE 7.3 Minimum detectable signal for a photomultiplier 

To calculate the minimum signal power from eq. (7. 1 3) we ^^ZZZ^^ 

^' nnoo I^ "l f e e - Snmuted from eq ' {7A ,); lhus if - — - 

r re rhl j2£ 025 Ivf H Ctl ° n * ° f ] ' 25 ^ and " Ca,h ° de ^"^ »f -™ K 
(so that A//<? = U.U25 ev) we have 

/ T = J.2 x 10 h x 10-- 1 x (300) : exp(-1.25/0.025) 
= 2x 10~ 14 A 

Nex t we calculate the responsivity R k using eq. (7.9). If we assume a quantum effi- 
ciency n of 0.25 at a wavelength of 0.5 urn we have 

rjeA. _ 0.25 xl.fix 10 ~ 19 xQ.s; x 10 " 6 
Rx ~ he = o^TliPS^Tif? = 0.1 AW 

inally, if we take a bandwidth of 1 Hz, then from eq. (7.13) we obtain 

(2x2x 10~ 14 x 1.6 x 10" 1<J x l)'/ 2 
V^min- ^ = 8x 10- I6 W 



F 



MULTIPLICATION NOISE 

rt is found that the current noise at the anode is always greater than that expected from shot 
n0 , s e alone (,e. GilteAf '-). The reason for this is a statistical spread m the secondary ec 
tron emtsston cc.eff.cien about the mean vaIue 5 whjch ^ ^ ^^ X e 

^ in crea.sed by a fac a, 5/(5 - )] ' - ret. 7.4). This factor is only appreciable when 8 ne " 
to unity; for a typical value of 6 = 4 the noise current is increased by some 15%. 

H ,MSON (OR NYQUrST) NOISE 
Johnson noise arises because of the thermal agitation of charge carriers within a conductor 
the random nature of thus motion results in a fluctuating voltage appearing aero h »n 
du ctor. The r.m.s. value of th.s voltage. AV, having frequency components betwe en/and 
y + A/ across a resistance /? at a temperature 7". is given by (see ref. 7 5) 

3 J (7.14) 

a phoroir.ultipI.er, such a no.se voltage will appear across the anode load resistor (/? j n 

Fig- 7A3) , 4, PraCtlCe ' J ° hnSOn n ° ISe '* ° ften SmalIer *» the ^-current shot no[se (s 
sample 7.4;. 

p L g 7.4 Noise in photomultipliers 



CuPP ose We tak£ a pho,omuit 'P lier Whi ^h has a load resistor of JO-' Q at 300 K nnri » h , 
5 j t h A/" of ] kHz - Then from e ^ (7 - 14 > 

aV/j 5 (4k 1.38 x lO"--' x 300 x 1 x lOV 2 = 4.1 x 10"* V 

, f the photomultiplier has a dark current of 1,)- A , then from eq. ( 7 . 12 ) there will be shot 
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noise current of 



A/ S = (2x 10 14 x ].6 x 10 iy x 10-) 



.^1/2. 



l.8x 10' 



If the photomultipliergain is 10 7 and if we ignore any multiplication noise contribution, the 
shot noise voltage signal appearing across the load resistor is 

AVj= 1.8 x H)~ l *x 10 7 x 10 3 = 1.8 x 10" s V 



7.3.4 Image intensifiers 

As their name implies, image intensifiers are designed to boost very low intensity optical 
images to the point where they become useful. They can also act as wavelength 'down- 
converters', that is they can convert near-IR radiation into visible radiation. The primary 
image is formed on a photocathode surface and the resulting photoelectron current from each 
point on the image is then intensified either by increasing the energy of the individual elec- 
trons (in the so-caWcd first-generation types) or by increasing the actual numbers of elec- 
trons (in second-generation types). The electrons subsequently fall onto a cathodoluminescent 
phosphor screen to produce the intensified image. A schematic diagram of a typical first- 
generation type is shown in Fig. 7.14. 
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FIG. 7.14 Schematic diagram of a first-generation image intensifier. Electrons are released from the 
photocathode and accelerated towards the anode which is coated with a phosphor layer. Electrostatic 
focusing elements ensure that electrons released from a certain spot on the photocathode are alt focused 
onto a corresponding spot on the phosphor screen. Both the photocathode and the phosphor screen are 
curved, and fiber optical coupling can be used to convert flat images into curved images and vice versa. 
On striking the phosphor screen, light is generated by cathodotuminescence. 
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The electrons may be focused onto the .screen by either electrostatic or magnetic means- 
usua |ly the former ,s used for reasons of simplicity. An accelerating potential is applied' 
between the cathode and the phosphor screen, thus increasing the electron energy Iron, , 
few electron volts to 10 keV or so. Luminance gains of up to 2000 may be achieved quite 
read.ly. while higher gams are po.ss.ble by cascadmg two or more units with fiber optic coup 
]jn g between them. i c Lmi »'~ 

Second-generation devices make use of the so-called nucrochanncl plate to achieve Ba in 
through electron multiplication. The microchannel plate consists of a slab of insulating nnter 
)al {at 500 ^ thick) with a high density of small diameter (» 15 urn) holes or ,/,„ w /v in 
i( Th e inner surfaces of the channels are made slightly conducting and a potential (- 1 kV) 
is applied between opposite faces of the slab. Electrons entering one of the channels are -,ccel 
--..f^l down it and strike the walls soon nfrerpm^n,. <.;„„„ .u_.. ... .... 



j >u »w, , • ■ a . — »'e.«"»-cincaxis or ttie channel i.s slightly 

inclined to the electron trajectory. As m the photomultiplier, secondary electrons are ten 
erated by the impact and the process is repeated along the channel as illustrated in Fi, 7 i s 
Focusing may be achieved most simply by placing the microchannel plate in close prox ' 
jmity t o both the photocathode and the phosphor screen; this arrangement gives a very" 
compact device. Alternatively, to achieve yet higher gain, the microchannel plate may be 
IaC ed just hi front of the screen in a first-generation image intensifies In some instances 
an electronic output may be required rather than a visual one, in which case the lumine'' 
cen t screen may be replaced by an array of electron detectors such as silicon diodes ' 

A number of improvements to the basic second-generation scheme have resulted in what 
are termed ihird-generanon devices. Use is made of high efficiency GaAs photocathodes 
wh ich have peak spectral response in the 0.8-0.9 um region where the night sky eives \ 
oton flux some five to seven times that at a wavelength of 0.5 urn. One of the problems 
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yAS Electron multiplication in a microchannel plate image intensifier. When the primary elect 
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with the original microchannel plate is that positive ions generated within the microchan- 
nels bombard the photocathocie eventually leading to its destruction. To counter this a thin 
membrane of aluminium oxide of thickness 3-5 rim is placed over the entrance to the 
microchannel plate which prevents any positive ions from reaching the photocathode. The 
membrane is sufficiently thin, however, to allow electrons with an energy greater than 2(X) eV 
to enter the channels. The newer plates also exhibit an improved spatial resolution by having 
smaller diameter channels (8 pin) and a smaller centre-to-centre channel spacing ( K) utn). 
In addition the walls of the channels are coated with magnesium oxide, a secondary elec- 
tron emission enhancer which further increases the gain. 

7.3.5 Photoconductive detectors 

As we showed in Chapter 2, an electron may be raised from the valence band to the con- 
duction band in a semiconductor where the energy gap is E a by the absorption of a photon 
of frequency v provided that 

hv 2 £, 

or in terms of wavelength 

\<hc/E^ (7.15) 

We define the band gap wavelength, X L „ to be the largest value of wavelength that can cause 
this transition, so that 

\ = hi-/E g (7.16) 

As long as the electron remains in the conduction band, the conductivity of the semicon- 
ductor will be increased. This is the phenomenon of photoconductivity, which is the basic 
mechanism operative in photoconductive detectors. For convenience, we suppose the semi- 
conductor material to be in the form of a slab of width W, length L and thickness D with 
electrodes on opposite ends, as shown in Fig. 7.16. An external potential across the elec- 
trodes is usually provided by the simple circuit of Fig. 7.17. 

Any change in the conductivity of the detector results in an increased flow of current round 
the circuit which will increase the potential across the load resistor R L . This may then be 
detected using a high impedance voltmeter. If we are only interested in the time-varying part 
of the incident radiation, then a blocking capacitor C may be inserted in the output line to 
remove any d.c. component. The optimum size for/? L in a particular situation is determined 
by the fractional change in the resistance of the photodetector when under maximum illu- 
mination. If this is small (say < 5%) then it may be shown (Problem 7.8) that the largest sen- 
sitivity is obtained when R L = R D , where R D is the photodetector resistance. On the other hand, 
if it is relatively large, then linearity of output can only be maintained if the potential drop 
across the load resistor always remains small compared with the potential drop across the 
photoconductor. This requires that R { <$R D . It is obviously advantageous, as far as the output 
voltage is concerned, that R D should have a high value. 

We suppose that the radiation falling normally onto the slab is monochromatic and of irra- 
diance /„. Not all of this incident energy will be available to generate electrons within the 
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FIG. 7.16 Geometry of slab of pr>otocondu<t,ve material Th f > i^„n n, 

Zs erodes on opposite faces; rad.at.on WIs onto t ^ ^ '^ h '■ «"««» VV and .hieing „ 
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7.1 7 Photoconductor bias drri.il. The pbotoconductor is placed in t series H , 
^ source, a ,oad res.stor , , *,. ^.^ ^ - < ^ ---^ inR , 

^.mconductor cause changes m the wlta^apj^utn-Mcrossff » I f u " ,the 



„miconductor: some will be reflected from the front surf-ice while 

^semiconductor without being absorbed a, all ^ Wl11 pa ' SS throL ^ 

' ^e reflection coefficient (assuming no ant.rerlection layer is present) is gi ven by ,- where 

j^e irradiance just inside the surface of the slab is thus 

KoW the irradiance at a po,„t a distance v ,nto the semiconductor, /(,), ean be written as 
/( -^ = /(0)exp(-ax) 
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where a is the absorption coefficient of the material. For wavelengths longer than the bandgap 
wavelength (A.J, the absorption coefficient is comparatively small, whilst for wavelengths 
below X,. a increases rapidly with decreasing wavelength, and can attain values in the region 
of IO' 1 m" ' or greater. Figure 7.18 shows the variation of a with wavelength for a number 
of important semiconductor materials. 

Values of a of the order of IC/'itT 1 imply that most electron-hole pairs will be generated 
within a few micrometres of the semiconductor surface, although for wavelengths nearer 
the band edge this figure may be considerably larger. The fraction of the incident irradiance 
which is actually absorbed in the semiconductor can thus be written (1 -r) x r|. lh> where 



r]. lhs = 1 -exp(-aD) 



(7.17) 



If we write r| = ( I - /•) x r]. lbs the total number of electron-hole pairs generated within the slab 
per second is r\I () WL/hv. The average generation rate r g of carriers per unit volume is then 
given by 



/■ = ■ 



hvWLD 



Absorption 
coefficient 
(m ') 




0.4 O.fi 0.8 1.0 1.2 1.4 1.6 1.8 

Wavelength (urn) 
fIC. 7.18 Variation of the optical absorption coefficient a with wavelength tor several semiconductor 



materials. 
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or 

' * hvD (7. IK) 

As we saw in section 2.7 the recombination rate ,; of excess earners depends on .he den 
sit ies of the excess earner populations An. A P (where for charge neutrality A, = A m i 
,hc minority carrier lifetime t c via the equation ' l 

,- = A/i/x L . = A/;/t c 

(7.19) 
In equilibrium the recombination rate must equal the generation rate, and therefore 

A; i = A/; = r T t 

(7.20) 
VVe may write the conductivity cr of a semiconductor material as (see eq. 2.24) 

HenC e under illumination the dark conductivity wiil increase by an amount Aa where 



A' = 



WO 

T 



AoV 



rising eq- ( 7 - 21 - 5 we then obtain 

WD 

V—j-rM^W (?22) 

We may define an effective quantum efficiency parameter, known as the photocondurtivn 
g ain G. « *' ™o of the rate of flow of electrons per second from the dev eft th t of 
generation of electron-hole pairs within the device. That is, 

__A/ 1 
pr> using eq- ( 7 - 22 )- 

^ L 2 (7.23) 

Iik e the quantum efficiency for say. the photoelectric effect. G may be larger than unity 
V wa y be mcrease by increasing V and decreasing L. although at high values of he ec 
Vic f* ld the CUmnl tends t0 saturate «*i"e to space charge effects. Hh>h values of rh, • " 
£„ als o be favoured by la,ge values of,., ahhough this .mpiies .hanhtls^:^^ 
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be correspondingly poor (see Problem 7.9). There is a good deal of evidence that in some 
materials (such as CdS) impurity energy levels exist within the energy gap into which car- 
riers may fail but which do not cause recombination; the carrier is merely released by thermal 
excitation at some later time. Such levels are termed traps or sensitization centres. Whilst 
a carrier is held in a trap, a carrier of the opposite type must be present in the semiconduc- 
tor to maintain charge neutrality - thus the presence of traps further increases the gain. This 
increase in gain, however, will once again be at the expense of the response time. Under 
fairly intense background illumination levels and at relatively elevated temperatures, the traps 
in most materials lend to be full; consequently they have little effect on the photosignal. 
Significant effects, however, may be observed at low temperatures and low background illu- 
mination levels. 

Figure 7.19 shows an idealized wavelength response curve for a photoconductive detec- 
tor assuming a constant quantum efficiency r| for carrier generation when X<X K . In prac- 
tice, r| is found to decrease at short wavelengths; this is a consequence of the increase in the 
absorption coefficient with decreasing wavelength (Fig. 7.18). Carriers are then being gen- 
erated increasingly closer to the semiconductor surface where there is often a much higher 
probability of radiationless transitions taking place than in the bulk. Careful surface passi- 
vation techniques are needed to ensure good short wavelength response characteristics. 

7.3.5. 1 Noise in photoconductive detectors 

The main source of noise in photoconductive detectors arises from fluctuations in the rates 
of generation and recombination of electron-hole pairs, and is termed generation- 
rccoiiibiitaiioii noise. Both optical and thermal excitation processes contribute to genera- 
tion noise. The relative importance of the thermal process is strongly dependent on the 
size of the bandgap, since the probability for thermal excitation of a carrier across the gap 
is approximately proportional to the factor exp(-£\,/2AT) (see the discussion following eq. 
2.36). Thus for detectors capable of operating out to relatively long wavelengths, and which 
consequently have small energy gaps, thermal generation noise is likely to be large unless 



Output per 
unit energy 
input 




Wavelength 



FtC. 7.1 S Idealized wavelength response curve for a photoconductive detector. Beyond \ the output 
fails to zero since the photons have insufficient energy to excite carriers across the bandgap. 
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thc .emper tare rs ed eed. As a rough ru ie of thumb, the operating .empera.ua- should 
hc sLI ch hat T<Ej25k Provided that the thermal generation noi.se has been reduced , 
nCgIlg ,b Ic proportions the ,m.s. no.se current fluctuations A," ,. within ,he irequenc e 

/to./ + 4/ resulting from generation-recombination is given by (ref. 7.6) 
_/ 4/>Cd/- y : 

^" (/) = (T7^y^J ( 7.24) 

Hcr c G is the photoconduct.ve gain. Tl . the minority earner life.ime and / the total current 

' "The no.se cun-en, gl ven by eq. C7.24) is almost independent of frequency when/, |/(^ , 
wh en/> 1/(2^ the noise current declines with increasing fluency. A, suffi ,en y g 
frcqU enc.es. the dominant mechanism will be Johnson no.se. The total no.se present over 
particular range of frequences may be obtained by integration of eq. (7.24) (see Problem 
7.H»- 

It is a |so found that a, frequencies less than about I kHz a relatively little understood source 
|>f nOI se known usffn-kcr or I// no.se becomes predominant. Flicker noise is presen 
most semiconductor devices and although the cause has no, been established with cert , 
[here a ppear to be some definite l.nks with trap distributions that are metastable at the e" ce' 
opcra t.ng temperature. Emp.ncally „ ,s found that the r.m.s. noise current due to flicker noise 

iay be written as '"^ u " ulst - 

whe re B is a constant for a particular situation (« 10""). A complete no.se spectrum for the 
photoconductive detector is shown schematically in Fig. 7.20. 

7.3.5.2 Characteristics of particular photoconductive materials 

Cadmium sulfide (CdS) ^cadmium selenitic (CdSe) are both used for low cost visible 
ra diation sensors, for example m light meters for cameras. These devices usuallv have hi 
^conductive gams (some l(r to Uf ) but poor response t.mes (about 50 ^Tn^ 
Sme is in fact strongly dependent on the ..lummation level, being much reduce at h 4 eve s 
beha v 10 ur ind.cat.ve of the presence of traps. A typical construction is shown in R e 7 ' ' 
fl]rn of the matena ,n polyerys.alhne form i.s deposited on an msulating substrate nd'the 
eJe ctrodes are formed by evaporating a suitable metal, such as gold through a m 
% comb-Hke pattern shown. Tins geometry, wh.eh result., m a^eJvT^ ™ f " 

^ tiVe e SU f22) " imerCleCtr0dC SPaCm - ^ <° ^ » ^ ™.vity toTe de^' 

( ^tead sulfide (PbS) is a well-known near-IR detector material with a useful wavelen e th 
response .from 1 pm to .14 pm. By varying the growth conditions, the detector chlra el 
t r jc , can be varied widely wi, the expected trade-off between gain and frequency res P n " 
I ty pical response t. me would be about 200 M s. In the waveiength reg i on of 2 Mm"" one 
% th e most sen.s,t.ve photodetectors. The wavelength response mayV extended to 4 pm 
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FIG. 7.20 Schematic diagram of the noise spectrum for a photoconductive detector. The frequency I, 
below which "1/P noise becomes predominant is about 1 kHz. Generation-recombination noise has a flat 
spectrum at frequencies below 1/(2tu ( ) f = /_,) where T, is the minority carrier lifetime. Above ( 2 
generation-recombination noise falls with increasing frequency as 1/f. At the highest frequencies (~ 1 MHz> 
Johnson noise associated with the circuit load resistor will eventually become predominant. 



CdS 




Evaporated 

metal 

electrodes 



FIG. 7.21 Top view of typical electrode geometry of a CdS photoconductive cell 



by cooling to -30°C (unlike most other photoconductive materials, the energy gap of PbS 
decreases with decreasing temperature). However, cooling also has the effect of reducing 
both the overall gain and the frequency response. The internal dark resistance of these detec- 
tors is usually quite high (» 1 MC2). This is an advantage when using the bias circuit of Fig. 
7. 1 7, since then comparatively large values for the bias resistor R L can be used, resulting in 
relatively high output voltage signals. 

Indium gallium arsenide (In v Ga,_ As, 0<x < 1 ), the ternary alloy of InAs and GaAs, has 
proved a very useful detector material for the region 1 u.m to 1 .6 u.m. It may be readily grown 
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„„ ,nP substrates provided the two materials are lattice matched, which occurs when v 

Indimn antimonide (InSb) detectors -avp „c„.,i!„ t . c 

I lw tmpedances ,. 50 L, a^^^X^^""' " 



have low tmpedances ,. 50 Q) . Consequently, then ou, u, " ^^e s,,,,,, Th 
have a wavelength response extending out to 7 um a J i„ h ; h „, " uU - The y 

,„ „,. Operation a, room temperature T s pa^^^^p^X;^ "^ 



re 
re 



esults on cooling to liquid nitrogen temperatures (? ! 'P^ no,se P^ance 

espon.se then shifts to 5 urn. ( } ' a,th ° Ugh the peak "^length 

Mercury cadmium telluride (He Cd Te, m™ ^ ^ u ,- 
, he .entimeta, H g Te and the semiconductor^ **K,T ™ ""7 """^ <"' 
,, n d conduction bands and may be regarded as hnvin ™ u ° verlap P ,n S ™l«tee 

^d S ap is -0.1 eV, Consequently, CL^' £ "c^^E X^ "" 
conductor can be formed w„h a bandgap varying between zero and , ft I, t V 
of pure CdTe). Detectors are available wbo.se pei l™^ j^ ™™ 
This region is of particular importance, since it covers the neat p m J ? P 

£, dies a. or .somewhat above ambient temperature., an! a £ IZ'Z o"aT T* ° f 
atin<M phcric transmission (.see Fig. ,. 8) . Usually coo|ing ^TZIZZ'ZI'X 
a sa ,irfac.ory noise performance, bu, detectors operating at the lower end of hi" wave n l 

o p E SEMICONDUCTORS 

o airier than using band-to-band transitions it is also nn«ih»P tni f 

«;' lev e,s within the bandgap to the appropriated Z £ T °yZ « ^ m ' PUr - 

"d boron-doped germanium detectors whose wavelength rt " ec " h f T"*- 

•-^ i-° U8h co °' in8 '° 4 K ( " qmd he " um ,empL 2 ^ ^^"^tT 

7 5^3 Vidicons and pfumbicons 

^ he vidicon is a generic name for a family of devices thit reli^ „ n, u 
^.oconducdvtty to convert an optic., .mage mo ^^^^^ " 
l typical structure. The optical nnage is formed on a thin target of semiconH I 
ja l (antimony trisulf.de is commonly used! rh., h„c „ ,.., JJ 8 semiCondu ^ng mater- 

potentJ 

^e, acts rather „ k e a W c^^W^^SS^Siir^ 



ity 

an 
to 



^ Q } on the side facng the mcdent rad.at.on. This conducting layer s conn' h^ 
,oteStiaI of -some +50 V above ground via a bias resistor. Th ofh r s d f t * " 

l° uCt or target is scanned with an electron beam i„ the same way as m CRT n ' 

iU _ , ar *et acts rather like a 'leakv' can^im,- u/i, :,, ... y * S m a CRT " In operation. 
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F1C. 7.22 Basil construe lion of a vidicon tube. 



high, and charge will accumulate on opposite faces. The electron beam side will charge 
up to around cathode potential (0 V), whilst the other will charge up to around +50 V. 
Under illumination, however, the resistivity of the target material will be much reduced 
and the charge on the 'capacitor' will leak away (i.e. the capacitor will discharge itself) 
whenever the scanning beam is not incident on the area in question. When the beam does 
return to the 'discharged' area it will recharge the beam side and a corresponding amount 
of opposite charge must be .supplied via the bias resistor and external bias supply circuit 
to the other side of the target. The amount of charge flowing will be dependent on how 
discharged the 'capacitor' has become, which, in turn, is directly related to the amount of 
light tailing on the target. The output voltage signal is obtained by taking the voltage across 
the bias resistor. 

One of the problems with the vidicon is its relatively high dark current, which gives rise 
to poor S/N ratios at low light irradiances. A device which exhibits very low dark currents 
is the plumbicon. This is essentially identical to the vidicon except for the nature of the 
photosensitive layer. In the plumbicon, this consists of a thin film p-i-n structure formed 
from lead oxide, PbO (Fig. 7.23a). The transparent Sn0 2 layer acts as the n-type contact 
while the other surface has an excess of oxygen, which causes it to be p-type. The region 
in between (typically 15 urn thick) is effectively an intrinsic semiconductor. With no illu- 
mination, the potentials across the device cause it to be in reverse bias (and hence very little 
dark current will flow). Any illumination of the film generates electron-hole pairs, which 
will then flow to opposite sides of the structure and reduce the amount of stored charge. 
As in the vidicon, when the electron beam recharges the beam side a corresponding amount 
of charge is drawn through the bias resistor from the external supply. The plumbicon is 
widely used in colour TV studio cameras. The energy gap of PbO is about 2 eV so that the 
red sensitivity of the device is poor; this can be improved by adding a thin layer of PbS 
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FIC - 7-23 Target structures for vidicon-type imaging tubes using U) a Iead-ox,de-based p-i- n s!r»< lure 
(the plumtaicon) and (b) a discrete array of silicon p- n junction diodes. 



(E g - 0A eV) '°} h " C ' eCt ™ beam side of ^ target. Any red light not absorbed by the PbO 
then absorbed by the PbS. 

,S A further development has been the replacement of the lead oxide target laver by an array 
of silicon diodes (Fig. 7.23b). Dark currents are very small, and the devices exhibit very 
uni form senstttvtt.es over the wavelength range 0.45-0.85 pm with a good tolerance to high 
light levels. 

~r ? 1 4 Multiple quantum well detectors 
7.J— " 

^e introduction of techniques for fabricating multiple quantum well (MOW) structures (see 
seC tion 2.9) has led to the development of some very interesting detectors based on them 
wh ich operate m the far-IR region. Within a quantum well an optically induced transition 
ca n take place between the sub-bands provided that the transition is Vertical' on the E k 
diagram (see section 4.6.1.1 and Fig. 4.13). That is. ,n the notation of section 2.9 a transi 
ion can take Place between states of differing „, only if they have the same values of', 
and n 2 (F* ™-> ; I we take an MOW structure and apply a field then the energy bl s 
beco^ 'tilted and , Ms then .possible for an electron in an excited state to tunnel through 
_„» ntia barriers (Fie. 7. ?.4M :md hpn,^ r,;„ Q ,;..„ . ~ . .. ""^u^ii 



the potential barriers (Fig. 7.24b) and hence give rise to a current now. A slightly diffeem 

*■ . _ „on hf made hv imntr ;) uff>ll en n.,^,-.s.., .U.. t _^i . . O J wwciciu 

1< 

well- - 

a bove the top of the quantum well (Fig. 7.24c). 



de vice can be made by using a well so narrow that only one sub-band is present within the 



?11 . Direct absorption can now take place between this'stafe and the continuum states which 
- above the top of the quantum well (Fig. 7.24c). 
,,e The sorts of transitions described here cover the wavelength range between about 5 U m 

d 2 p.m (see Example 7.5). Although this range is already covered by the mere ry 
cadmium tellur.de photoconductor there are considerable processing problems associated 

tv, making devices from this material. ieu 
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FIG. 7.24 (a) Electron states in a quantum well (no applied field); the dimensions are such that only two 
levels are present (quantum number n, = land 2). With an applied field we may observe photoconductiv- 
ity due to (b) transitions between n, and n_, followed by interwell tunnelling or (c) transitions to the electron 
continuum of states. 



EXAMPLE 7.5 Detection wavelengths in quantum well detectors 



According to eq. (2.64) the energy difference between the >?•,= 1 and /?, = 2 states in 
a quantum well (n, and n 2 remaining constant) is given by 

h 2 (2 2 1 : \ 



A£ 



Here we have replaced m by m* to allow for the fact that the well is within a semi- 
conductor material and is not simply in a vacuum. Assuming a well 10 nm wide (=L.) 
of GaAs (where m* = 0.068 m) we then obtain A£ = 0.165 eV. The corresponding 
optical wavelength is then given by hc/AE, that is 7.5 \xm. 



7.3.6 Junction detectors 

7.3.6, 7 The p-n junction detector 

When a p-n junction is formed in a semiconductor materia!, a region depleted of mobile 
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charge carriers is created which has a high internal electric field across it r see the ,r 

in section 2.8. J ). If an electron-hole pair is generated hv nh J I d.scussion 

Z metric fie.d separates the electron arlf ho"" ^? n Z "T" ^ ^ ^ 

Wc may detect the charge separation in three distinct wavs. FirsTif the dev.ee is I .ft 
ope n aicuu (or with a very high resistance between the external contal an 1 " , " 
sur able potenfalw.II appear between the p and n regions; this ,s the/tfLrl^ o de of 
operation. Secondly, in the/;/^,^^. mode a v , 'J " (/ " «»»^ »' 

^ ween the external contacts and a photogenerated cu^fl™ ^ Fm^he 7 t 
usU al way to operate the device ,s in xte photoconductire mode where at rse n ' S 
acr0 ss the junction and the resulting current How through an external IoX L^ „ ^ 
The load resistor in this case need not be as small as in the photoamperic mode 

The Junction will also respond to electron-hole pairs generated away from the depletion 
rcgl on prov.d ed they are able to diffuse to the edge of the depletion reg L befb e e™„ ? 
nation takes place. From the discussion in Chapter 2, it is evident that onfv 
ated within a minority earner diffusion length o'r so of the ed £ he de Luon 7 ■^ 
likely to be able to do this; nevertheless, this does increase the sensitive v t £ ^Z^ 

In operatton, we may represent the photodiode by a constant current eenerlr t h 
no w .being generated by l.ght absorpt.on) with an ideal ^Z^Z^T 
ef fect of the p-„ junction), as shown ,n Fig. 7.26. The internal characteristic of , 

be better modelled by the introduction of a shunt resistor (R .), a shunt e ° J„ r ,r ' ^ 
a S eries reststor (*,,. If we assume a fraction n of the mcidentdft" a 2T^ t * f n 
the cell, then we may write cu WITnni 

'*- hc 0.25) 

where /„ is the light irradiance falling on a cell of area A. 




ftC 
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.hole 



Depletion region 



Electron energy level digram illustrate the generation and subsequent separation nf 
pair by photon absorption within the depletion retfon of a p-n junTon ' 
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We now develop expressions for the sensitivities of the photodiode when operated in the 
photovoltaic and photoconductive modes. For simplicity, we assume that operation is at fairly 
low optical modulation frequencies, so that the effects of the shunt capacitance may be 
neglected (i.e. / c = 0). 

Considering the remaining current flows as shown in Fig. 7.26, we have 

(7.26) 



'x = 'd + 'sh + 'cM 



also 



V CXI = V d -/ L ,/? s 



and 



VWsh*sh 



(7.27) 



(7.28) 



Since the diode is assumed to be 'ideal' we may take its current -voltage behaviour to be 
given by eq. (2.5 1 ), that is 

f eV,\ 



exp 



AT 



where /„ is the diode reverse bias leakage current. 

In the photovoltaic mode, the external resistor (R L ) is made sufficiently large so that the 
current flowing through it, /„,. is small compared with the photogenerated current, i x . In addi- 
tion the value of the shunt resistor is usually sufficiently large that the shunt current i' sh is 
also much smaller than i x . Thus it is a reasonable approximation to assume that in eqs (7.26) 
and (7.27) we can put r' aI = i sh = 0, and hence these equations become i x = i d and V exI = V' a 



\'<d 



Y 



V 



r- WV — ° 



v/ c 



'ext 

o->- 



'ext ^L 

o>A/W — | 4 



Photovoltaic Photoconductive 
mode mode 

FIG. 7.26 Photodiode equivalent circuit. In operation, the photodiode can be represented by a 
photogenerated current source i k feeding into an ideal diode. The internal cell characteristics are better 
modelled by the introduction of a shunt resistor (RJ, a shunt capacitor (C,,) and a series resistor (RJ. In the 
photovoltaic mode, an external high value resistor (/?,) is connected across the output and the voltage 
across this measured. In the photoconductive mode, an externa! bias (V il()|V ) is applied in conjunction with a 
series load resistor (R L ). The current flowing through R, is monitored by measuring the voltage across it. 
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respectively. Substituting for i d in the former we obtain 
'eV„\ 



h. = '- 



exp 



kT 



Rearranging gives 

Under normal operating conditions />/ () , so that we may writ 
feV.A L 



lay write 



exp 



kT I i, 



It then follows that 

.Substituting for i\ from eq. (7.25) we have 
kT /r{I t} e'k tj A\ 



or 



V = — in 



kT /n«-M\ kT } 



(7.29) 



Hence the external voltage depends on the logarithm of the incident light irradiance 

In the photoconductive mode, a relatively large reverse hhs Ninv ! 

.pplie^-edio^ 

va lues of reverse b.as (i.e. a few tenths of a volt), eq. (7.26) can be written as 



/, = '<> + 'sh + 'ex 



As in the photovoltaic mode we may assume that both /, , and / sh are much less than / and 
u^nce we may write /, xf = ,,. inan <x a ™ 

ituting fn 



Substituting from eq. (7.25) we then have 



leM 



he 



wi th an external load resistor of fl, , the output voltage will then be 



V c xt hc 



"/?, 



(7.30) 



(7.30a) 



j^ericc, 



t in the photoconductive mode the exiernal current fl 



the 



incident light irradiance. 



owing is directly proportional to 
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Finally, in the photoamperic mode, the situation is somewhat similar to that in the photo- 
conductive mode: provided the external resistance is sufficiently small, almost ail of the pho 
togenerated current will flow through it and hence the output current and voltage will be the 
same as eqs (7.30) and (7.30a) respectively. However, if the external voltage (=/? L x/ CVI ) 
becomes too large, the internal diode may become sufficiently forward biased to divert an 
appreciable amount of current from the external circuit; this will lead to a non-linear response. 

The photoconductive mode thus offers the advantage of an inherently linear response over 
a relatively wide dynamic range. It also offers a more rapid response than the photovoltaic 
mode. The main drawback is the presence of a dark current (/'„+ i' sh ) which, as in the pho- 
tomultiplier, gives rise to shot noise (section 7.3.3.2) and limits the ultimate sensitivity of 
the device. All modes of operation are subject to generation noise (section 7.3.5.1), but recom- 
bination noise is absent since the charge carriers are separated in the depletion region before 
they can recombine. 

A typical structure for a p-n diode junction detector based on silicon is shown in Fig. 
7.27. Contact to the semiconductor material is made via a metal-n + (or -p + ) junction; this 
is found to be the most convenient way of providing an ohmic contact (see section 2.8.6). 
If we assume an abrupt junction together with the external bias, V, being much larger than 
the internal junction potential, V () , and also that N x p N d , then the depletion layer widths can 
be written as (see eqs 2.60, 2.59 and 2.55) 



x = 



2e„e r y 
eN, y 



1/2 



and 



- v p = I 



' 2e t) eW 



(7.31) 



Since we have a p + -n structure it follows that. v n >,v p ; Fig. 7.28 illustrates the resulting elec- 
tric field variation within the depletion regions. For efficient detection the electron-hole pairs 
should be generated within the depletion region. At short wavelengths, where the absorption 
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FIG. 7.27 Typical silicon photodiode structure for photoconductive operation, A junction is formed 
between heavily doped p-type material (p + ) and fairly lightly doped n-type material so that the depletion 
region extends well into the n material. The p + layer is made fairly thin. Metallic contacts can be made 
directly to the p + material but to obtain an ohmic contact to the n material an intermediate n + layer must 
be formed. 
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e p"-n junction diode shown in Fig. 7.27, assu 



ming an 



coefficient is relatively high (Fig 7.18). they will be generated close to the surface 
Consequently, to ach.eve a good short wavelength response the p+ region should be made 
as th in as possible. Conversely, at the upper wavelength range of the detector the absorption 
coefficient will be relative.y small and a wide depletion region is required for high detect o 
efficiency This imphes that large values of the reverse bias voltage are needed (see Examri" 
7>6) which may approach or exceed the diode breakdown voltage. Detection efficiency m v 
als o be improved by pmv.ding an an ti reflect, on coating to the front surface of the detectJ s 
consisting of a k/4 thick coating of Si0 2 . aeteuoi.s 

EXAMPLE 7.6 Depletion region thickness in Si p + - n phorodiodes 

If we take a silicon diode with a moderately doped n region, that is N = 5 x in 21 m- 1 

an d an applied voltage of 100 V, eq. (7.3 1 ) gives .v n = 5. 1 x 1 - m . that is 5 1 ^ m ' 

mspection of Fig. 7.18 shows that at 0.8 um the absorption coefficient of silicon is 

ab ° Ut K h m H h S ° Ti ''TT , FreSnei refleCti0n ' thC fraCt] ° n of the -dent radia- 
tion absorbed by a 5.1 um thick layer is I -exp(-5.1 x 10" fi x 10 5 ), that is 4 This is 

obvi ously msuffiaent if a high efficiency photodiode is required at this wavelength 

, n fact to absorb 80% of the Ration would require the depletion layer thickness to 

be at least 20 pm wide. To ach.eve this with the doping levels considered here would 

req uire an applied voltage of approximately 100 x (20/5) 2 V, or 1600 V In fact eler 

ical breakdown would occur long before a voltage as h lg h as this could be applied" 



tri 



6 2 The p-i-n photodiode 



str ucture that results in a good long wavelength response with only relatively modest bias 
A v6ls is the so-called n-.-n (or PIN) structure, illustrated in Fig. 7.29. Here the intrinsic (1) 
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FIG. 7.29 Electric field distribution within a p-n structure. 

region has a high resistivity (i.e. low values of A/ a and N d ) so that only a few volts of reverse 
bias are needed to cause the depletion region to extend all the way through to the n region 
and thus provide a large sensitive volume. In practice, the bias is maintained at a consider- 
ably higher voltage than the minimum value and the intrinsic region then remains fully 
depleted of carriers even at high light levels. The depletion region width in a p-i-n struc- 
ture is then practically independent of applied voltage and thus much better delineated than 
in a p-n structure where the depletion region width will vary appreciably with applied voltage. 
For this reason most simple photodiode structures are of the p-i-n rather than the p-n type. 
For efficient detection of photons we require that as many as possible are absorbed within 
the intrinsic region (assuming a p-i-n structure). If the thicknesses of the p and i regions 
are vv and vv, respectively, and assuming a surface reflectance of R, then the fraction F t of 
the incident power that is absorbed within the i layer is given by 

F, = ( 1 - R) { exp(-ow p ) - exp[-a(w p + w-)] } (7.32) 

Assuming that vv « of ', so that absorption within the surface p + layer may be neglected, we 
may ensure that most of the incident radiation is absorbed in the i layer by requiring that 
vv, < of 1 . To be more specific, if vv, = 2of ' then some 86% of the radiation entering the device 
will be absorbed (there will also be reflection losses at the surface but these can be reduced 
by the incorporation of an antireflection coating). Silicon p-i-n photodiodes can achieve 
quantum efficiencies of up to 80% in the wavelength range 0.8-0.9 urn. A typical spectral 
response of a p-i-n silicon photodiode is shown in Fig. 7.30. 

The problem of low detector efficiencies at wavelengths close to the bandgap wavelength 
can also be addressed by using detectors that are illuminated from the side (i.e. parallel to 
the junction) although these are not commonly available. 



7.3.6.3 Photodiode materials 

For the detection of radiation in the visible and near-IR region one of the most popular 
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)hot od.ode materials is silicon Th.s has an energy gap of 1 . 14 eV and, as mentioned above 

in aC hieve quantum ef icienctes of up to 80% between 0.8 um and 0.9 um An xam" 
tion of Fig. 7. 1 8 shows that at these wavelengths the absorption coefficientTabout o" "" 
^s to obta.n efficiencies as high as 80% we require an intrinsic layer thickness of ,h , 
la »m and also a surface p+ layer thickness that is substantially I ss [ h n a ^n 

£ we sha,l see in Chapter 9, the deve,o pmen t of optical fL com™, aI Xs 
_ i^H ro the demand for detectors nnprminr, , J( - „i___.,_ ,. _ k-anun .system!, 



has , ed «, the demand for detectors operattn'g „ wa ^ ' ™ 7^ , T h 

^ ave high sens,,,v,,,e S and exceptionaiiy w.de bandwmths. PhoJcT^ r,^ 

niun , are available (the bandgap wavelength forGe is 1.88 gm) bu , they , a!f „Uof'Zr 

„ sensiuv.nes and large reverse b,as leakage enrrents, this | a „er betng a geLZnh , 

° sm all ba^gap sermconducors. Much more successful have been ma eSl ,h a , „ 
'"«„ on InP substrates such as the ternary compound In Ga As IntZ 
%** Ga,'n,., As,.P„. The materials must be lattice m^ edCl^P ^ i^rT *"% 
f„ G«,-, As - ,akeS "' ace " he " •'• - "•*. The bandgap is then 0.74 eV which comlsn T ° f 
'"bandgap wavelength of 1 ,68 pm. Unfortunately when homojunc Hon" madTSj ^" ? 
^V bandgap matenal.s the, display relative,, low breakdown vo.mg a ™ tX" 

.... leakage currents. In fact by takirtP en nsn „..,, .... £ . ge reverse 



'A 

nutrv" --- -■ ... ._ „ ljr JUVV incaR . uown voltages and laree reverse 

leakage currents. In fact by taking eq. ,2.51a) and using the lZZ"Z 

V *„„- »? -W ;°f »" -.fli «q. B..16). it can be shown tha, y„« M -E An 

/ 7 " „^mientlv instead of homoiuncrinn.; it ,c „.,..,i ♦„ <■ , ° Pl ^V*- 7 ) 



,, *"n - "' r ' >' " . -i- ^' JU '- " «-«"' oe snown that /„« expf-£ /jm 

^onseq^ntly instead of homojunctions, ,t ,s usual to form heterojunctions whh fit/' 
Sidgap catena L An example of such a structure is shown ln Fig'" where n H 
? a " na^As is sandwiched between : , n * i»,„ r „<■ r„o .._, 8 . ' Where dn ' la y er 



a ndg a P . ' • — "••"'"uuicisMiuwnmhig 73 

,f ln ^Gao. 47 As is sandwiched between a p + layer of InP and an n lay 
fje radiation has to pass through a layer of InP which has a bandgap wave 



er of InP. Because 
elengthof0.92um, 
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FIG. 7.31 Basic structure of a p-i-n heterojunction InCaAs detector. 



then no radiation below this wavelength will be detected. A further advantage of this struc- 
ture is that since no carriers of interest will be generated in either the surface p + layer or 
the lower n layer there will be no diffusion of carriers to the junction from outside the deple- 
tion region, which, as we shall see in the next section, can worsen the response time of the 
detector. 

From Fig. 7.18 we can see that the absorption coefficient for In 05 _,Ga n47 As close to the 
bandgap wavelength is higher than in the case of silicon. This is essentially because 
In (] „Ga 1) 47 As is a direct bandgap semiconductor whereas silicon is an indirect bandgap semi- 
conductor. As a consequence the intrinsic layer thickness can be much smaller than in the 
case of silicon. At 1.5 urn, for example, the absorption coefficient of In^Ga^As is about 
10 m" and thus to achieve the condition w- = 2fx x we require an intrinsic layer thickness 
of the order of only 2 urn. 



7.3,6.4 Response time of photodiodes 

There are two main factors which limit the speed of response of photodiodes: these arise 
from the finite transit time of carriers across the depletion region and from the RC time con- 
stant associated with the electrical parameters of the diode and its external circuitry. 

DRIFT TIME OF CARRIERS THROUGH THE DEPLETION REGION 

In very high electric fields the drift velocities of carriers in semiconductors tend to saturate. 
Thus provided the field within the depletion region exceeds the saturation value for most 
of its length, then we may assume that the carriers move with the constant saturation veloc- 
ity *\ar Consequently they will take a time x dnft to traverse the depletion region of width w. 
where 

*drifi = VV K. (7.33) 

During the whole of the time that the carriers are moving across the depletion region a 
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current must be flowing through the external circuit. Thus if a very short duration 
radialion ,.s absorbed within the depletion region then an output signal extending ov 
r (!nfI wiH be observed Typical values for the saturation velocities are of the orde,- of 10 
and these are attained in fields of above about 10 6 V m _l . 




JUNCTION CAPACITANCE EFFECTS 

fn sc ction 2.8.4 it was shown that the capacitance of an abrupt junction can he wrii.cn 

Cj (7..M) 

whe re w is the total depletion region width. Forap-i-n photodiode we mav assume a similar 
expression where ,v ,s now the w.dth of the i reg ion. Although junctions' are rarely a 
, n practice, ,t st.H remains true that junction capacitance decreases with increasing re e 
bias. For example, m a linearly graded Junction we have C °c v~^ 

We see from Fig. 7.26 that at high frequences the diode capacitance acts as a shunt across 
the output resistance network and reduces the output. Usually R^R L and R ^ . ^, 
as far as the diode current ,, .s concerned, the diode capacitance C ,s in parallel w> h h > 
)oad reS istor R t ,s easy to show (P rob]e m 7,4) that if the current', , ^ I 
, ate d at frequency /then the output voltage of the device is given by 

^e electrical bandwidth Af is defined as the frequency range over which the output is above 

l/ tr^^ x 5) ' Thus in the presem instan " we ^ 

i 

^ = 2^f } (7.35) 

^e bandwidth may obviously be improved by reducing C, Inspection of eq. (7 M) shows 
' h at this may be achieved by decreasing the diode area and increasing the thicknes of T 
h reg ion. Unfortunately both these courses of action have associated difficult! V e s^T 

> ]y a limit on how small the diode area can be made without encountering problems ass D 

'iated with focusing an incident beam onto a small area. In addition increasing the widmof 
, la yer will increase the v a ,ue of ^ which will worsen the frequency response a ft 

t J _M-*»r transit time is concernet. Thprf> w/iii ti,,,.. »,„ • . J . P ulJ sc as rar 



he i "v- - tt - ■ , " llri " '" ,VUI! »cii me rrequency response as f-ir 

" s earner transit time I; s concerned. There w,l, thus be an optimum value for the tWckne« 

f the i region which w. 1 result ,n max.mum overall bandwidth (or. equivalently. a minin m 

he overall response time). ^ ^•""unnnuni 

Jj vv the response time associated with the detector RC network can be written i t = R r y 



a JL ct s ca n then be written 2 

6 * 2 ^ +T «" (7 . 36) 

< w W~ »' m n. RC .~ l . /w aml '! 7 eUSy t0 show ^ ^blcm 7.16) that x is minimized 
^ h en x** - T - ThlS ' S the ™™» ^Mgn criterion for ensuring the fastest diode re" po^ 
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EXAMPLE 7.7 Response time of an InGaAs p-i-n photodiode 

We consider a p-i-n photodiode based on InGaAs (i.e. as in Fig. 7.31) where the i 
region has a thickness of 2 urn and whose area is 100 urn x 100 urn. The load resis- 
tor used is 50 il. We take the saturation velocity of electrons in InGaAs to be I0 5 ms ' 
and the relative permittivity of InGaAs to be 12. 

The transit time of electrons through the depletion region (t, (I ) is given bv 
2x Hr'VlO 5 s,or2x 10-" s. 

From eq. (7.34), the device capacitance is given by 

^ (100 x 10-' , ) 2 x8.84xl0- ,2 xl2 

C r~ ^^ -5.3X10-F 

The value of t w is then #^01- 50x5.3 x 10~ n = 2.65x 10"" s. 

The total response time for the detector t may then be obtained from eq. (7.36): 

t = [(2x 10-") 2 + (2.65x IO- n ) : ] 1/2 = 3.32xlO- n s 



Although the above two effects are the main causes of finite response times in p-i-n pho- 
todiode.s, there are others. One that should be mentioned arises from carrier diffusion: car- 
riers generated outside the high field regions may contribute to the output provided the 
electron -hole pairs manage to diffuse into the high field region before recombination takes 
place. The problem with this is that carrier diffusion is inherently a relatively slow process 
and thus can give rise to a delayed response by the photodiode. The time taken for excess 
carriers to diffuse a distance d may be written (ref. 7.7) 



d 2 
2D. 



T <""~^77 (7.37) 



where D c is the minority diffusion coefficient. Thus distances of the order of micrometres 
typically involve carrier diffusion times of the order of nanoseconds (Example 7.8) and so 
carriers generated outside the depletion region may thus cause a 'slow' tail to be present in 
the response as illustrated in Fig. 7.32. Normally in a p-i-n photodiode the number of such 
carriers is sufficiently small that they cause few problems. 

EXAMPLE 7.8 Diffusion time of carriers in Si 



We consider the time taken for electrons to diffuse through a layer of p-type silicon 
5 urn thick. Taking D = 3.4 x lO"- 1 mV, we obtain from eq. (7.37) that 

(5x 10~ 6 ) 2 



T = . 

2 x 3.4 x 10 



7 = 3.7xl0- y s 



7.3.6.5 Schottky photodiodes 

In a Schottky photodiode a metal-semiconductor junction (section 2.8.6.2) takes the place 
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junction photodiode. The main advantage of the Schottky photodiode is at wavelengths appre- 
ciably shorter than the bandgap wavelength X„. Here, in a normal p-n junction diode heavy 
absorption in the semiconductor surface layer gives rise to a much reduced quantum effi- 
ciency. In the Schottky diode the surface antireflection coating-metai surface layer can be 
much more highly transmitting at these wavelengths leading to higher efficiencies. However, 
the devices are relatively difficult to fabricate (great care is needed, for example, in obtain- 
ing the correct thicknesses of the antireflection coating and metal layer) and offer no sig- 
nificant advantages over p-n junction devices at wavelengths closer to the bandgap 
wavelength. 

Schottky photodiodes are available using a variety of semiconductors such as Si, GaAs. GaP 
and GaAsPand are mainly used for their relatively high blue and UV sensitivity. It is of inter- 
est to note that the device can respond to photons that have a lower energy than that of the 
semiconductor bandgap. An inspection of Fig. 7.33(b) shows that photons which are absorbed 
in the metal and which have energies as low as e(<(>-x) can cause electrons to surmount the 
metal -semiconductor barrier by thermionic emission and contribute to the photocurrent. 

Photodetectors have also been made using metal-semiconductor-metal (MSM) junctions. 
These can be made very simply by forming two interdigitated metal contacts on top of a fairly 
thin (say 3 |am) undoped semiconductor layer (Fig. 7.34). The spacing and area of the con- 
tacts is such that at fairly modest bias voltages the semiconductor layer under the electrodes 
is almost completely depleted whilst ensuring that the light-sensitive area presented to incom- 
ing radiation is between 50% and 75% of the total area. The energy band diagram is shown 
in Fig. 7.35. The dark current is mainly determined by thermionic emission of both holes 
and electrons over the respective barriers and is comparable with that in p-i-n photodiodes 
(i.e. I nA or lower). It is interesting to note that the device capacitance is approximately four 
times smaller than a p-i-n junction of comparable light-sensitive area (only half the spacing 
between the electrodes is filled with dielectric, the other half is filled with air). This low capa- 
citance is obviously very helpful in achieving a high speed of response. The spacing between 
the metal contacts can be made smaller than for a vertical diode, giving shorter carrier transit 
times and again increasing response speeds. Bandwidths of 20-50 GHz are possible. 
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FIC. 7.34 The interdigitated electrode structure used for MSM photodiodes. 
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FI C. 7-35 Energy band diagram of a metal-semiconductor-metal junction under bias. 

Su ch devices have been successfully made using GaAs as the semiconductor As (■„• ,s 
,n () .,,Ga (f As is concerned there ,s a slight problem in that the Schottky burner height so ^ 
small wh.ch results in a high dark current. This may be reduced by interpos ,1 
In<) mA , As surface layer between the contacts which enhances the barrier heih The 
de vices hold considerable promise as they are relatively easy ,o fabricate and a o tonu 
ate with amplifier structures (see section 9.4). 



7 3.6.6 Avalanche photodiodes 



As explained above, most fast photodiodes are designed for use with a 50 fi load mined 
anC e and the voltage output (« , M f ten requires cons.derable amplification. i«fS r " 
nal ampliation of the photocurren, ,s achieved in the avalanche photodiode (APD) Z s 
dev.ee a basic p-n structure ,s operated under very high reverse bias. Carriers trav srn^ 
the depletion region therefore gam sufficient energy to enable further earners to be L d 
acroSS the energy gap by impact excitation. The process is illustrated in Fig 7 16 
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An electron having reached point A on the diagram has enough energy above the con- 
duction band bottom such that it can collide with an electron from the valence band and raise 
it to the conduction band (C— » D) (the minimum energy required to initiate this process was 
discussed in section 4.3 when dealing with cathodoluminescence). This generates a new elec- 
tron-hole pair: in so doing, the electron will of course lose an equivalent amount of energy 
and move from A to B. The newly generated carriers may both subsequently generate further 
electron-hole pairs by the same process. The probability that an ionizing event takes place 
is governed by the carrier densities and also the ionization coefficients. The latter vary rapidly 
with electric field /' according to a relation of the form exp(-A/'f) where A is a constant. In 
addition it should be noted that the magnitudes of the ionization coefficients can differ 
depending on the type of carrier. In silicon, for example, the ionization coefficient for 
electrons is considerably greater than that for holes, whilst in germanium the two are almost 
identical. 

Current gains in excess of 100 are readily achievable. However, as shown in Fig. 7.37, the 
current gain is very sensitive to the value of the bias voltage, and if the bias voltage is made 
too large there is the danger of creating a self-sustaining avalanche current that flows in the 
absence of any photoexcitation, which sets an upper limit on the voltage that may be used. In 
fact non-uniformities within the device can cause small regions of premature breakdown known 
as nucroplusmus. Obviously the device requires a very stable voltage supply and, in addition, 
since the avalanche process itself depends on temperature, temperature stabilization is required. 
Another problem that can occur is that of excessive leakage currents at the junction edges. In 
silicon APDs this latter problem can be addressed by the use of & guard ring as shown in Fig. 
7.38. This also serves to restrict the avalanche region to the central illuminated part of the cell 
and thus helps to reduce premature breakdown. Unfortunately the guard ring also increases 
the capacitance of the device thus restricting the high frequency performance. 
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FIG. 7.37 Typical variation of current gain with reverse bias voltage for an avalanche photodiode. 
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FIG. 7.38 Silicon avalanche phofodctector with guard-nng structure. The guard ring is .i r< 
, om[^>'ativety low doping, and hence the depiction region extend, an appreciable di^Umu 
, h( . vicinity of the guard ring the total depletion laver thickness is greater, and hence the m. 
f,«'ki strengths are less, than in the central region. 
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Another type of avalanche device is the readi-ihnn^h avalanche phowiliodc (RAPD) 
T^i.s has a p + 7ipn + structure, as shown in Fig. 7.39(a) (k indicates lightly doped p maternl)' 
when a sufficiently h.gh external voltage is applied the field distribution is as shown in 
Fig . 7.39(b). The field extends all the way through (i.e. it reaches through) ,he n rctnon to 
the pn + junction where the fields are h.gh enough to give rise to avalanche sain In open 
tl on incident radtation is absorbed in the relatively wide n region. The fiekfthen separates 
the charge carriers with the electrons moving to the avalanche region where they lake inn 
in t he avalanche process as described above. Since electrons have a much larger ionization 
coefficient than holes in silicon, the fact that only electrons initiate the avalanche process 
has little bearing on the overall current gains achieved. In addition, because the applied 
voltage is 'shared' between the (w.de) n region and the avalanche region the sain becomes 



]at jvely insensitive to variations in applied voltage. 
As in the case of photodiodes. the extension of the range of 



APDs up to a wavelength 
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7.39 (a) Structure of a silicon RAPD devi< e. <„, The field distribution across the RAPD; the high field 
ccion is across the p-n junction. 
gain re t> 
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of 1.6 |im requires the use of semiconductors with smaller bandgaps than silicon. One 
immediate problem with such materials is that dark currents will be larger and although 
cooling can help, this obviously adds to the complexity of the detector. Germanium has 
been used reasonably successfully, but the fact that the electron and hole ionization coef- 
ficients are almost equal gives rise to relatively large amounts of noise arising from vari- 
ations in the avalanche gain process itself (see the comments at the end of this section). 
As with junction diodes considerable effort has been put into developing InP/InGaAs 
devices. The problem of increasing dark currents with decreasing bandgaps has been solved 
by separating the absorption and gain regions (as in the RAPD) with the absorption occur- 
ring in the narrow bandgap material In ()S ,Ga ()47 As whilst the avalanche gain occurs in the 
wider bandgap material InP. This structure is called the SAM (Separate Absorption and 
Multiplication) APD and is illustrated in Fig. 7.40. 

NOISE CONSIDERATIONS 

As well as the usual sources of noise discussed previously another factor appears with the 
avalanche photodiode. The exact number of carrier multiplication events produced by a single 
charge carrier as it moves through the high field region will be subject to statistical varia- 
tion (because the distance travelled before an ionizing event occurs will be subject to sta- 
tistical variation). If there were no such fluctuation in the gain process then the shot noise 
contribution would simply be multiplied by the gain M. However, because of the variation 
in M we mav write 



A/ s = M[2F(M)(i x + i D )eAf) 



1/2 



(7.38) 



where i x is the photogenerated current, i D the dark current and F{M) a function called the 
excess noise factor. Where electrons alone are injected into the high field region it has been 
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FIG. 7.40 Basic structure of a SAM APD device based on InP. Note that in this example the radiation is 
incident through a window that has been etched into the substrate. 



shown (ref. 7.8) that F{M) can be written as 

[ / \\fM~\V\ 
F(M) = M 1-fl 



M 
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Ma) 



where r is the ratio of electron to hole ionization coefficients (for the case of hole mi . -r 

uS ed for F(M) ,s M , with . -taking a value between and I depending on the m 

An inspection of eq. (7.38a) shows that when i— =o then F[M)-*2 1 /m U , 

^ , then ^M. It is thus advantageous fro m a noise^ t o v~i ^ ^ " 

- herC th£ ,0mZatl ° n Pr0bab " itieS ° f the two types of carrier are very diff^mfZ ^ 
other. 

? 3.6.7 Phototransistors 

^phototransistor is another device. like the avalanche photodiode, where the current How 
from a p-n junctton detector ,s internally amplified. The construction is basicX.hr 7 
junC tion trustor, w,th the base region exposed to the incident rz6^^^ 
nal connection IS made to the base (see Fig. 7.41a). To understand the operation o the de " 
we consider the external currents to be as shown in Fie 7 41 fM Th P U 
Supplied by the photogenerated current. g " ^ ^ *"* Currcnt '" Wl11 * 

\Ve must have 



= K 



Base 



(not 



connected) 



9 + Bias voltage 
Collector 




I'- 



'to a/', 



i- 



Collector 



Bas 



Emitter 



(a) (b) 

M 1 External connections made to an n-p-n phototransistor (a). Light absorbed within the base 
e «on causes an em.tter current to .low through the load resistor ^ and thus a s.onal volt ,e w « 
JU it- fc> The CUrrentS dSSum «' '» ! - «'>w.ng in the phototransi tor * 8 '" ^^ 
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where ? t . and i\. are the collector and emitter currents respectively. The collector current has 
two components: (a) the normal diode reverse saturation current / ai , and (b) that part of the 
emitter current that manages to cross to the collector. (The current is carried by minority 
carrier diffusion across the base, and not all the minority carriers leaving the emitter will 
reach the collector.) We write this latter current as a/ c , where a is slightly less than unity 
(a is known as the common base current gain). Thus 

'«. + "■',- = L- ~ 'b 
whence 






(1 



kj 



a 



where !i k . = a/( 1 - a) is known as the common emitter current gain of the transistor. Typical 
dark values for // lL . in phototransistors are about 100. With no incident radiation i h = and 
the current flowing. i a ,(h k . + 1), is the dark current of the device. This is obviously larger 
than for comparable p-n junction devices when the dark current in this notation is just i w . 

When illuminated there will be a base current of magnitude i x , where, from eq. (7.25), 
i x = r\(/ {) Ae'k t Jhc). The external current flowing is now (f\ + / co ) *{1 +h( K ) which, if i\ *» /„. 
is equal to /\(1 + /i, L ,). Thus the device gives us internal gain, and has a responsivity lying 
between that of a p-i-n photodiode and an avalanche photodiode. 

Silicon-based phototransistors are readily available at low cost; a typical device structure 
is shown in Fig. 7.42. Such detectors usually suffer from a poor frequency bandwidth, often 
being limited to a few hundred kilohertz. This arises both from the high capacitance of the 
base-collector junction and the long carrier transit times across the base region. However, 
the presence of internal gain can greatly simplify detection circuitry where the small band- 
width is not a problem (e.g. in remote control devices for TVs and videos). 
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FIG. 7.42 Structure of a silicon phototransistor. 
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I, is possible to make phototransistors with much higher frequency bandwidths Figure 7 41 
for example, shows a structure for an n-p-n phototransistor based on an InGaA.sP/lnP he! ' 
cr ojunct.on. As ,n the InGaAs/lnPp-i-n photod.ode of Fig. 7.31. incident radiation with -, 
wavelength greater than 0.92 um will pass unattenuated through the upper InP layer and w.ve 
lengths up to the bandgap wavelength of the base material will then be absorbed in the base. 

? 3.6.8 Itensified photodiodes 

A recently developed device that is a hybrid between a photomultiplier and a phoiod.ode 
is the intensified photodwde. In this device, which is illustrated in Fig. 7.44, the incoming 
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F ,C. 7-43 Structure of an n-p-n phototransistor based on an InCaAsP/lnP heterojunction. 
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radiation strikes a photocathode which is situated on the inside of a vacuum tube as in a 
photomultiplier. The emitted photoelectrons are then accelerated down the tube (to an energy 
of about 8 keV) and focused onto a small area semiconductor p-i-n junction or Schottky 
diode. As the high energy electrons travel through the semiconductor material each gen- 
erates several thousand secondary electrons, thereby producing a current gain. Commonly 
used photocathode materials include GaAs and GaAsP which can cover the wavelength 
range from 340 nm to 830 nm. 

In terms of performance their D* values can be appreciably larger than comparable pho- 
tomultipliers or avalanche photodiodes. One drawback is that they require a relatively high 
operating voltage (8-15 keV). Usually the intensified photodiode is supplied with a compact 
high voltage supply operating from a 12 V supply. 

7.3.7 Detector arrays 

In section 7.3.5.3 we have seen how an optical image may be converted into an electrical 
signal by using a vidicon-type device. An alternative approach is to use an array of discrete 
detectors. The image may be focused directly onto a two-dimensional array, or alternatively 
the image may be scanned past a one-dimensional array. The problem with either of these 
more direct approaches is that to obtain a scan of the image requires that the output of each 
detector must be read out sequentially. Given that an array may contain many thousands of 
detectors, the problem of correctly interconnecting them is not trivial. Although nowadays 
this is not a real problem, it was in the 1970s and so an alternative approach was proposed 
by researchers at Bell Laboratories. This involved using a charge-coupled device (CCD) 
structure. The basic building block of this is the metal-oxide-semiconductor (MOS) capa- 
citor. This is formed by growing a layer of silicon dioxide (Si0 2 ) on a p-type silicon sub- 
strate; a metal electrode is then evaporated on top of the oxide layer (Fig. 7.45). The metal 
electrode is known as the gate and is biased positively with respect to the silicon. 
Photogenerated electron-hole pairs within the silicon will be separated, with the electrons 



+ V Transparent conducting 

layer (gate) 
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FIG. 7.45 MOS capacitor structure. When the gate is biased positively, photogenerated electron-hole 
pairs become separated and the electrons then become trapped at the SiO, -Si boundary beneath the gate 
electrode. 
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being attracted to the surface of the silicon under the gate, where they will remain whilst 
the gate voltage is positive. The electrons are effectively trapped within a potential well 
formed under the gate contact. The amount of trapped charge will be proportional to the total 
jnte grated light flux falling onto the device during the measurement period. 

The problem now is to 'read out* this charge sequentially along a line of such detectors 
This can be done by passing the charge from detector to detector. There are several wtvs 
(> f achieving this, the basic idea being illustrated in Fig. 7.46. The 5 ate potentials are sup- 
plied from three voltage lines (L,, L,, L-,) each one being connected to every third electrode 
(G V G 2 , G 3 ). We suppose that initially the potential of L, is at some positive value V whilst 
U and L, are at zero potential. Photogenerated charge will be trapped under the G elec- 
trodes in proportion to the amount of light falling on these elements (Fig. 7.46b) After i 
suitable integration time, the charge may be moved along the chain of MOS capacitors by 
applying a repeated sequence of potentials to the gate supply lines. Thus, suppose we apply 
a voltage V g to L 2 whilst maintaining L, at V g : the charge initially under L, will now be shared 
be tween G, and G 2 (Fig. 7.46c). Next we reduce the potential of L, to zero. All the change 
that was initially under G, is then under G 2 (Fig. 7.46d). Continuing this cycle will pro- 
gressively move the charge along the line of MOS capacitors from left to right At the end 
of the line, the amount of charge arriving as a function of time then provides a sequential 
can of the 'G, ' detector outputs. 
For obvious reasons, this is known as a three-phase scheme. In practice several other 
.schemes are possible and ref. 7.9 may be consulted for more details. One problem with the 




7 .46 Basic CCD array composed of a line of MOS capacitors (a). The initial charge distribution 
ithin the potential wells when G, has a positive voltage applied <= V p ) and C, = G, = is shown in <b) ln 
c = C 2 = ^ C > = ° and the char W h ^ "<> w spread out under both G, and C, In (d) G = C = V 

iC> d C 3 = °- ThS Char8e tht1 ' ,mtlally W ' 1S Under C| hab now moved to the ^ to be under C, NoteW 
*° charge should be generated under the gates C. and C,; these elements are therefore screened from' 
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device as described here is that a new light scan cannot be earned out until all of the charge 
has been transferred along the array. A faster scanning rate may be achieved with the layout 
indicated in Fig. 7.47. Here, a second CCD array (the transport register), which is shielded 
from incident radiation. lies alongside the first. Once a charge image has been built up in 
the sensing array it is then transferred 'sideways' to the transport register and can be read 
out sequentially along the transport register. This readout takes place at the same time as a 
new image is being built up. 

Two-dimensional arrays based on the above one-dimensional designs are also possible, 
and are known as frame transfer devices. Here the transfer registers feed into a readout re- 
gister running down the edge of the device (Fig. 7.48a). The contents of each line are read 
out in sequence into the readout register so that the signal appearing at the end of the re- 
gister represents a line-by-line scan of the image. A problem with this scheme is that the 
presence of the transfer register reduces the resolution possible by increasing the distance 
between the sensing arrays. It is possible to dispense with the transfer register altogether by 
adopting the design shown in Fig. 7.48(b). Here, each array is made twice as long as before, 
with the second half of the array being shielded from the incident radiation. Then, after a 
charge image has been built up it is moved along each array into the shielded section where 
it can be stored until it can be transferred into the readout register as before. Again, readout 
takes place whilst a new image is being built up. This scheme is known as interline trans- 
fer. Although it offers better resolution than frame transfer, it is also somewhat slower because 
of the relatively long time taken to transfer charge along the sensing array and into the storage 
section. 

It will be evident that whichever of these schemes is adopted the photogenerated charge 
has to undergo a large number of transfers before being 'read off. It is absolutely essential, 
therefore, that as little charge as possible gets Most' on the way. The fraction of the total charge 
which is successfully moved in each transfer is called the charge transfer efficiency (r\J. 
Thus after m transfers the original charge will be reduced by a factor (r) ct )'". For the readoul 
signal to provide a faithful representation of the pixel signal it is essential that n,. t is very 
close to unity. Thus consider a 1 024 x 1024 CCD array which operates with r) ct = 0.9999 (i.e. 
1 electron lost in 30000). The maximum number of charge transfers will be 3 x 2048 (assum- 
ing a three-phase scheme) and the amount of the original charge that is transferred will be 
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FIG. 7.47 Basic arrangement of the detector array and transport register in a linear CCD optical sensing 
array. The directions of charge transport are indicated by arrows and shading indicates those areas that are 
shielded from incident radiation. 
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0.9999 614 or 0.54. In current CCDs values for n , can reach 0.999 999. Such values Ivive 
come about as a result of the virtual elimination of crystal defects in the materials used When 
cc p devices are used m an environment such as space where there is a strong possibility 
of damage due to rad.ation bombardment then their performance can be severely limited 

In modern designs the gates are made from polysilicon (Fi e . 7.49) which is re-isomhlv 
transput to radiation between 400 nm and 1 100 nn, However, quantum efficiencies are 
the n lifted to about 0.35. The wavelength range can be extended downwards towards the 
v V by coating the front surface of the array with an appropriate phosphor material Another 
r oach is to thin the substrate sufficiently to allow light to be incident from below Much 
hlgh er quantum etfiCiencies up to 0.9) are then posstbie and the short wavelength range 
^tended down to 200 nm. This procedure ,s not without its problems: the thinning proee 
can cause the introduction ot delects, and the very thin structure i.s difficult to support and 

keep optically flat. 
l ° TvVO -dimensionaI arrays typ.cally contain 1024 x 1024 pixels, althonah arrays with hreer 
lb ers are available. The p,xel side lengths can vary between 7 M m and 50 urn. The rate 



' which the images can be read out can vary greatly depending on the size of the array and 
f h e a P pl ication - F ° r exam P Ie - when a ™V* '^ "** in camcorders, video rates (i.e. 30 images 
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FIC. 7.49 illustration of the polysilicon gate MOS structure in a CCD array; illumination may be either 
through the gate or from the back. 



per second) are required although image quality is not as high as TV standards. The small- 
est light signals that can be detected depend on the number of 'dark' electrons; this is the 
number of electrons that accumulate under the gates during the integration time which are 
not photogenerated. At video rates of image transfer this can be about 10 electrons per pixel, 
although lower values can be obtained especially on cooling. Thus it is possible to detect 
light levels resulting in something like 40 photons per pixel. If yet higher sensitivities are 
required then the CCD array may be placed after an image intensifier. 

Arrays of other types of photodetector are also available, one of the most common being 
based on the silicon photodiode. IR night vision devices can make use of detector arrays based 
on, for example, GaAs/GaAlAs. InSb and HgCdTe. The two latter detectors need to be cryo- 
genically cooled. Another interesting development has been that of the microbolometer. 
Using a combination of photolithography and selective etching it has been possible to man- 
ufacture arrays of small silicon 'plates' 0.5 urn thick and of area 50 urn x 50 \xm which are 
supported clear of the underlying silicon on two Megs'. Incident radiation causes the tem- 
perature of the silicon slab to increase, and this temperature increase may then be detected 
by any of a number of means, for example by measuring the resistance of the stab (see section 
7.2). Such detectors can have comparable performance with the cooled arrays. 

7.3.8 Liquid crystal light valves 

The liquid crystal light valve is an optical to optical image transducer that takes a low irra- 
diance light image and converts it into an output image which uses light from another source 
(and which, in consequence, may be of a much higher irradiance, e.g. a laser beam). Although 
not primarily used for radiation detection purposes the operation of the device is more con- 
veniently dealt with in this chapter than elsewhere. It was first developed by the Hughes 
Aircraft Company in the 1970s and has undergone substantial development since. Figure 
7.50 shows the basic construction of the first-generation light valve. The device consists of 
several layers at the heart of which is a nemetic liquid crystal cell. The Mead' beam first 
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7 _50 Structure of a first-generation liquid crystal light valve. 



tr averses the liquid crystal layer, is reflected from a dielectric mirror, repasses through the 
IiqU id crystal layer and then exits from the device. The exact workings of the liquid crystal 
cell will be dealt with later since it operates in a slightly different mode from that described 
in section 4.8. For the moment we assume that the liquid crystal cell-mirror combination 
ca n be switched from a non-reflective mode to a reflective mode by the application of a suit- 
able voltage- When no writing light is present the cadmium sulfide photoconductive layer 
a very high resistance so that the potential, which is applied across the whole device 
. , j ^., nml , iu;.. i.„.^.„.u.,...i Li- i 



; s rnainly dropped across this layer rather than the liquid crystal, and consequently the liquid 
crystal cell-mirror combination is non-reflecting. When a writing beam is present radia- 
tion is absorbed within the cadmium sulfide layer and its resistance falls; the potential across 
he liq uid cr y stal then in( -' reases causing the 'read' beam to be reflected. 
1 if the liquid crystal cell were operated in the 'normal' way (e.g. as in Fig, 4.26) it would 
wg. necessary to have a polarizer between the cell and the dielectric mirror, which has a high 
distance and thus would always have a considerable potential drop across it. In the hybrid 
re * . j „ffprt mode the liquid crystal molecules have a twi^r nnolp nf n«u, as° r,„ .,„„.' 



field effe ct mode the liquid crystal molecules have a twist angle of only 45° (in contrast to 
Vhe more usual 90°). The required polarizer and (crossed polaroid) analyzer functions are 
irn bined at the 'read 1 input by using a polarization beam splitter (Fig. 7.51). Light from 
' cnurce is polarized as it is reflected into the device, whilst for the returning be.m nni„ 



' d into the device, whilst for the returning beam only 
,"-aht ^ ith an orth °S onaI Polarization will be transmitted straight through the beam splitter 
Il/ith no potential applied across the liquid crystal cell the polarized input 'read* radiation 
dergoes a 45° twist as it passes through the cell; it is then reflected and "untwists' on its 
U " y back to give exactly the same stale of polarization it had on entering. However, the ana- 
* zcx ^ U1 onIy transmit the » rth «g°n«I polarization, and so the radiation is blocked. As soon 
ly a poten fia! is applied to the cell, however, the molecules start to move away from their 
3 * iform 45° twist directions. The radiation that is transmitted is then not completely linearly 



i r »o 



PHOTODETECTORS 



I lyfl! valve 



W HlL' Iv;,,,, 



Polan/mt: beam 
spinier 



■nput 




Input 
read beam 



JM'ot ,, 



P"W/,it,<,n i )( , arri 



' lK|U,d ( r W<>! ii K hi valvl "" '" a< hK ' Vi ' lfK ' rW i uire ' 1U ' nb " ,r P< 



Lirin-r an< 



j.m 



; ilv r( " 



I tl"' 



i/'iii..- • . ' '' uc - a - an the molec 






Ul1 "^ P«tenti-,I ™ , " Wl11 be ;,bIc 'o Pass through the analyzer. WP - , pt,M 

i/ati »n is the s-,me , ^ ,noIecules >"« up alone the field and again the ^ ^i 
Activity or ; he ce dS the ,n P ut - leading to the radiation ac ain being ^^ th* ,h r 
^aracterisiic is quite ?* iL T ,on of applied voltage is shown in Fig 7 -^ du ,tive W 
an << the did*., J " C dlfterent h 'om that of Fie. 4.27). Between the photo*^ flf , h < 



'reading' b 



flmto rlTr """ "' "H" lcu Village IS Miuwu ." ■ -^ „auci^ 

^ite chfferent from that of Fi,. 4.27). Between the phot^ , 
're-u 7°? U H e h, -hIocking layer; this is present to ensure that 



" ,U '"S neam reaches ,u u g 

0t -^'.hen 1 ain rn : h . eph0t0C °-'^-"ve 



lai n problem 



w hieh arises from thu r. '"^generatioi 

rcs P»nsc time. I n s £ n P n ? enCe ° f ln t* in *e cad 
M °S striiL-tur 



layer 



S W,th f'rst-generation types is their relatively slow r* ,p 



alotigP ho, r; n sili <: 



mium sulfide giving a wt r , hv a 



,xy ° su-ucture as i|[,, st " L 'P es tri e cadmium sulfide layer is i< 

fiction (active) ph- lS e ,h '" Fi§ ' ^ The device °P erates in a ' W °' 
earners. The electron -h i aPP '' ed V(>I,a S e is such that the ^-silicon is 
W,th 'he electrons moving Pa ' FS thilt ^e generated by the input 'write' t 
a grid of n-tvnp '.„:.._ .> ° tne S'/SiO, interface Wh m th P rharee arri 



tl* 



.„ 1 ' «". t.<».lllllUIlI Mill 1UC Kl»"»6 

us t ,;S"!I a S n ?P! S L he «*™m sulfide layer a^- cyd , ^ 

"is deplete^ 3^1. 
■ - are s e r r ..,. L - 



-type 'mieromod ^ ^^ ^fa^ When .^charge arr.ve^tt^^. ^ 
eS serves to contain the charge within well- 



beam 

ives a 
.define' 1 



Rl-H 



ei'tance 



FlG.7.s 2 The transfer f u 
twist angle. 




A PP«ed voltau 



age 



nrtl ° n "' " HqUid Cr V staI ceil operated in the hybrid field effect mo- 



de *"' 



,^ 5 " 



PH 



OTON nEVICES 



:r»i 



"-'iiicroduKlc-. 



S1/S1O, dielectric minw 




(:< uro () f 



Dnve luluigc 



^Hunci-ocncMtion liquid aysMl !l S ht va vl ' 



Hm . nttern. The pr^ 

.^le Ct 4 St he ««*,,,„,,,,,•,,„ phasc . dlinng which an o P po ^ P th ac "| _ 

*><% at the Si/S.O, interface are .hen at* ac« d ^ „£ 

... . . . .u , u„tv T material. 1 m- i = con vci 




01 appUUO, r",he input is ^infa r , light 
- -cen projection systems whereby * P^ , arn p.Ho f> ? , „ 

' Ca ^so bf d '^ rCUd b " im is U P ° Wert " ^ daw P «»* ssing SY 
UsC( -i in image processi 



:ing and optical ^ W P 



352 PHOTODETECTORS 

7.3.9 Photon counting techniques 

A useful technique that is sometimes used when dealing with very low level signals is photon 
coitming. Usually the arrival of a single photon at a detector gives rise to a very small output 
signal. However, if the detector exhibits considerable internal gain (as in the photomulti- 
plier and the avalanche photodiode) it may be possible to detect the arrival of a single photon 
as an output voltage pulse. If optical power P x at wavelength X {t is incident on such a detec- 
tor whose quantum efficiency is n, then we would expect to see a pulse rate R = r\P x Xj(hc). 
Ideally the pulses should all have the same amplitude, but the gain values will always exhibit 
a statistical fluctuation about a mean and consequently we should expect the output pulse 
heights to be similarly distributed. 

Not all the output pulses will be due to the arrival of a photon as there are bound to be 
spurious noise pulses present. These may be allowed for in two ways. First if the signal 
is switched on for a given time and then switched off for the same length of time we would 
expect approximately the same number of noise pulses in both time intervals, and hence 
a simple subtraction of the pulse counts obtained over the two time periods should enable 
a better signal count rate to be obtained. Secondly the noise pulses are likely to have a much 
wider range of pulse heights than the signal pulses, for example Fig. 7.54 shows a pulse 
height distribution containing both signal and noise from a photomultiplier specifically 
designed for photon counting. By only counting those pulses which lie between the limits 
expected from the signal pulses then a number of noise pulses will be rejected. When 
avalanche photodiodes are used for photon counting, it is usual to cool them to reduce ther- 
mally generated noise pulses, and to operate them with a reverse bias at or just above that 
normally required for breakdown to ensure that the arrival of a single photon will gener- 
ate the largest possible output pulse. However, a quenching circuit is also required to 
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FIG. 7.54 Signal pulses have a relatively restricted range of pulse heights, whereas noise pulses show a 
much wider range of pulse heights. Discriminator levels can therefore be set to accept most of the signal 
pulses while a large proportion of the noise pulses will be rejected. 
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prevent the subsequent continuous avalanche breakdown that would normally follow such 
an event. 

One problem with this scheme arises from the finite response time (t.) of the detection 
system- The output pulses will have a finite time width, and if the pulse rate (R ) is too hieh 
then there is the danger that two pulses will arrive within a time equal to the response time 
an d the two signal pulses will then merge into a single large output pulse and be either dis 
counted or miscounted This will happen when t s « l/tf p . This places an upper limit to the 
signal power that can be detected. An additional problem with the avalanche photodiode 
qU enching circuitry is that there will be a period of time following a detected pulse when 
the detector is inactive. ' 

7 3.1O Solar cells 

Qne of the more obvious sources of renewable energy is the radiation from the sun Just 
outside the earth s atmosphere the irradiance is about 1400 Wm"-. Because of attenuation 
in the atmosphere, this figure talis to between 500 Wm^ and 1000 W,rf : on the surP.ce 
of the earth depending on the angle of incidence of the radiation. The peak in the solar sp'ec- 
trurn is at about 0.6 urn and it has a useful wavelength range of about 0.3 urn to ? um The 
aim of the solar eel is to convert as much of this radiation as possible into electrical enerey 
In essence the solar cell is simply a p-n junction detector operated under conditions such 
lhal it can deliver power into an external load. The full i-V characteristics of a p-n junction 
und er illumination are sketched in Fig. 7.55. The solar cell operates in the quadrant where / 




increasing light intensity 

7 .55 Current-voltage characteristics of a p-n .unction solar ceii under various levels of illurrunation 
L e d,rk characteristic ,s hat oi an ordinary p-n junction diode. Under increasing levels of illumination th 
f Le is pr°gress.vely shitted downwards. The .ntercept, of the curves with the V = axis give the values 

,'P' . _.♦ <-i,rn\t MirrpnK li ) ,.,V,;K.t tk„ ;„f .-.,_., ■ „ ^ give me values 



ng levels of illumination the 

?5 with the 1/ = n tv-;,- „:. ,„ .l_ . 

0- the short circuit currents ( K , ,, whilst the intercepts with the / = axis give the values for the open circuit 
stages'" * 
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is negative and V is positive. Most power will be delivered when the product ixV is a 
maximum and this determines the optimum load impedance across the cell. It is fairly evident 
from Fig. 7.55 that we require both the open circuit voltage (V tK .) and the short circuit current 
(i .) to be as large as possible. From eq. (7.29) we see that the open circuit voltage is pro- 
portional to the factor ln[i"i/ 1 jC^,/\/(/?r/ )], and thus we need to make /„, the reverse bias leakage 
current, as small as possible. Reference to eq. (2.51a) shows that this may be achieved by- 
making the minority carrier populations (i.e. n and/; n ) as small as possible. This implies that 
the doping on either side of the junction should be as large as possible. We saw in our analy- 
sis of the p-n junction photodetector that a signal could be obtained not only from photons 
that were absorbed within the depletion region but also from electrons and holes that had dif- 
fused to the depletion region after being generated outside it. In photodetectors this addition 
to the output was not regarded as advantageous since it increased the response time unduly. 
In the present instance, however, we are not at all concerned with response times and so the 
additional current flow is useful, especially as it is likely to arise from absorption of long wave- 
length photons where the absorption coefficient of the semiconductor may be relatively low. 
To increase this contribution the diffusion length of the carriers should be as large as poss- 
ible. Since L i . = (D < .t i ,) 1 ' 2 this requires long minority earner lifetimes. Unfortunately large values 
for the minority carrier lifetimes imply low values for the impurity doping concentrations (see 
eq. 4. 16) and hence a compromise on the doping concentrations has to be reached. The most 
common structure for silicon solar cells, for example, is n + -p-p + . 

If the maximum power is obtained when the current is / m and when the voltage is V m then 
the fill factor F is defined by 

V x/ 

F = ^l "I (7.19) 

V x/ 

IK .vc 

In a good cell design, fill factors of as high as 0.8 can be obtained. The upper surface of a 
solar cell usually has a 'finger' network of ohmic contacts on it for efficient collection of 
the current. Typically these cover some 5-10% of the surface and their design needs some 
care to minimize the effective resistance that the electrons encounter as they travel from the 
junction to the external circuitry. 

The first requirement of the solar cell material is that it has a bandgap which allows the 
absorption of a substantial part of the useful solar spectrum. The material must also exhibit 
long carrier diffusion lengths (in other words, high mobilities and long carrier lifetimes). 
As well as silicon there are several potential candidates within the II- VI and III— V groups 
of compounds, for example gallium arsenide, cadmium telluride and copper indium dise- 
lenide. Although these latter materials are in general more difficult to process than silicon 
they can be made into more varied and efficient structures (e.g. it is possible to have 'cas- 
cade' structures so that the incoming radiation encounters a series of cells with decreasing 
bandgaps which progressively remove longer and longer wavelengths). The expense involved 
in manufacturing such efficient structures can only be justified in space applications and in 
circumstances where the light is concentrated using some sort of lens or mirror structure so 
that only a relatively small area of solar cell is required. 

At present, most commercially available cells are made from silicon which can be in single 
crystal, polycrystalline or amorphous form. Of these the single crystal cells are generally 
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ihc most efficient (10- 12%) whilst amomhonsrpHv- .,nu u l. 

thc cf f .cency. over a period of time, to some 80% of the initial value "" 

NOTES ___ 



i. 



In contrast to the situation in eqs (7 P) and n u, ,u,\- • ,. 

explicitly involves/and hcneel hav " re fc"r to 1 e noi '"T ^ ^ ^ "^ 
narrow frequency interval d/. ^ n ° ,Se W ' thin the '"finitesimally 

Strictly speaking this relationship only applies if the sv^m ™„ r ■ 



A lead sulfide photodetector has a sensitive -ire-, nf in-* ^ , ■ 

ra d,ano„ of 2 Mm wavelength. Us ina F 7 '„*" L™ "I," """ '" ***' 

7 2 Calculate the wavelength at which the enerirv nf a „i, , u 

? ' 2 a vera ge ,hen,,a, e„e rgy of a,o ms ,„ . ^ f^ LClt .^ ^ "' "" 

7.3 Starting from eq. (7. 1 ), that is 



d(AT) 

W = // + GAT 

d/ 



which represents the energv balance condirinn in Q ,v, i j 

ass a mi „ S ,ha, V, Ml 4r ™ y be w-r'Tth" o T ^ er :T *p^ ""! 

A7"= Ar + A7-,. cos(2nfr + 4>,o. show that: ° f tos W') a ™' 

(a ) A7„ = W^G; and 

(b) &I ( =W f l{G 2 + ^Y-H l ) U2 . 

7 4 Show that the effective thermal conductance G for ■, r^t;*,- >• . 

7 temperature 7 may be written R " d rad ' atlVe hnk at abso >"te 

C R = 4ov4e7"' 

W here a is Stefan's constant, e the receiver surface emissivity and A its area Hen 
sh ow that the noise power fluctuations (AWX in a detector Jv a u r 0Ce 
by radiative exchange, is given by '"' ' ° f bandwidth AT limited 

(AW f ) R =4(Aad<rAfy /2 . 

Assuming e = I , calculate (AWA R for a detector of wi inn ■> 

£ room temperature (300 K). «eaorofa.ea 100 mnr and bandwidth I Hz 

One fairly easily made bolometer consists of a long length nf „„ r 

?■> (usually co PP = r , ^ ins , lle an in , uteing ^itS; 



356 PHOTODETECTORS 

sometimes known as a 'rat's nest' calorimeter. Show that the fractional change in the 
resistance of the wire (length L and radius r) when a heat pulse of energy H joules is 
input can be written 

AR Ha 



R it 2 LpS 

where p is the density of the metal, a its temperature coefficient of resistance and S 
its specific heat. 

As mentioned in section 7.1.2 the change in resistance may be measured by 
incorporating the bolometer element into a Wheatstone bridge circuit (Fig. 7.4). 
Determine how the current through the galvanometer depends on the voltage applied 
across the bridge and the magnitude of the heat pulse (assume for simplicity that the 
resistances in the arms of the bridge are all equal before the heat pulse arrives). 
Hence determine the responsivity of such a device made from 100 m of fine copper 
wire given the following data for copper: 

Density (p) 8930 kg nf 3 

Specific heat (S) 384 J K" 1 kg" 1 

Resistivity 1.7xl0" 8 fim 

Temperature coefficient of resistance (a) 3.9 x 10"" 1 K" 1 

7.6 The equivalent circuit of a pyroelectric detector may be taken to be a current source / 
feeding into a parallel combination of a capacitor C and a load resistor R L (see Fig. 
7.7b). 

If P is the dipole moment per unit volume of the pyroelectric material, show that 
the surface charge is PA where A is the surface area. Hence show that ;' can be written 

dP dT 

i = A— • — 

dT dt 

where T is the temperature of the detector and AT /At the rate of increase of 
temperature with time due to the absorption of radiation. 

By assuming the results of Problem 7.3, show that if the detector completely 
absorbs radiation of irradiance W () + W f cos(27i/0 then the voltage responsivity of the 
device /? v can be written 

R^A dPR <- 2 * l 



dT G (1 +4K 2 f 2 H 2 /C 2 ) [ < 2 (1 +4n 2 f 2 Rl) l/2 
Sketch the behaviour of R v with/. 

7.7 Estimate the minimum photon flux required for a television scanning system using a 
photomultiplier tube with a quantum efficiency of 0.1. The signal bandwidth is 
5 MHz and a signal-to-noise ratio of 100 is required. 

7.8 Show that when the simple bias circuit shown in Fig. 7.17 is used with a 
photoconductive detector to detect small signal levels, then maximum voltage output 
signals across R L are obtained when R L is equal to the detector resistance R D . 
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7.9 



Show that the gain G of a photoconductive detector can be written as r; = T / T 
where t, is the minority carrier lifetime, and x d is given by ' '' 



1 _]_ j_ 

T«l T n T P 



x n and x p being the times taken for electrons and holes to drift across the photodetector. 
7 .1 o A photoconductive detector is used in conjunction with a load resistor R, and a 
detection circuit which operates over a frequency range of to f m Hz Show that 
the r.m.s. power fluctuation due to generation-recombination noise'is given by 



2ieR L G 



AW,.,-— T- tatr 'W ra ,A.) 



7IT 



where / is the photoconductive current, G the photoconductive gain and x the 
minority carrier lifetime. c 

Optical power of 1 uW at a wavelength of 0.9 um falls on a photoconductive 
detector which is used in conjunction with a load resistor of 20 kQ and which h-m •, 
quantum efficiency of 0.7. The thickness of the photoconductive layer is 10 urn and 
in operation a voltage of 10 V is applied across it. Assuming the values x =0 5 ns 
^ e -0.25m-V- s-> and u h = 0.05 nr V" S "', calculate the noise power 'expected 
from generation-recombination noise when the detector bandwidth is 50 MHz 
Compare this with the magnitude of the shot noise power which will be present. 
7J1 An ideal photodiode (of unit quantum efficiency) is illuminated with lOmW of 
radiation at 0.8 urn wavelength; calculate the current and voltage output when the 
detector is used in the photoconductive and photovoltaic modes respectively The 
reverse bias leakage current is 10 nA. 

7l2 Show that the field within the depletion region in a typical P -i-n photodiode 
detector is such as to give rise to a saturation electron velocity (assume a linear 
change in field with distance across the depletion region). 

3 It is desired to make a photodiode based on Ga^n^As^, v grown on an InP 
substrate. Determine a suitable composition given that it is to be used at l 55 um 
you may assume that lattice matching occurs when y = 2.\6x, and that the lattice- 
matched bandgap is given by 

£ g (ineV)=l.36-0.72v + 0.l2r 

r1 4 Show that the responsivity of a photodiode detector as a function of signal 
jnodulatioi frequency/can be written as 

R(0) 

where C d is the diode capacitance and R L the load resistance. 

It is desired to make a silicon p-i-n photodiode with an area of 1 mm 2 with as fast 
7- a response time as possible when used in conjunction with a 50 £2 load resistor. 
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Estimate the thickness of the intrinsic region required. Take e r =11.8 and t> s (the 
electron saturation velocity) as 10 s ins"'. 
Over what wavelength range would you expect the device to be most effective? 

7.16 When both capacitance effects and carrier transit times across the depletion region 
contribute to the frequency response of a photodiode. the resultant time response, t. 
may be written as 

t, = t; + x- 

where t l . and t, are the response times corresponding to capacitance effects and 
transit time effects respectively. By considering the dependence of these response 
times on the depletion region thickness show that the optimum width for the 
depletion region in a p-i-n photoconductor as far as response time is concerned is 
obtained when t l . = t,. 
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Fiber optical waveguides 
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It has long been realized that light, with its earner frequency of some 10 14 Hz has the poten 

tia l to be modulated at much higher frequencies than either radio or microwaves and tin. 

opens up the possibility of a single communication channel with extremely high inform- 

,ion content. One of the mam difficulties in implementing this was that without some so'r 

of guiding medmm, any form of atmospheric transmission was limited to Iine-of-si c ht com 

mU nications and was thus subject to the vagaries of the weather. As long ago as 1870 Tymhll 

demonstrated that light could be guided within a water jet as a result of total internal reflec 

tion (ref- 8 - ' )• However, although some theoretical studies were carried out in the early years 

of the present century, it was not until the mid-1960s that the idea of a communication system 

based on the propagation of light within circular dielectric waveguides was considered sen 

ously < ref " 8 - 2) ' 0ne of the reasons for th >* delay was that initially it was assumed the wave 

gU ides would be a dielectnc rod which would carry a single electromagnetic mode Theory 

showed that such guides would have an extremely small diameter and furthermore the mode 

fields would penetrate into the air surrounding the rod, giving rise to high losses and makine 

it difficult to support the gu.de. These problems were overcome with the proposal in 1954 

(r ef. 8-3) ^ use cladded dielectric waveguides. The basic structure of such a guide is shown 

jn F jg. 8.1. It consists of a central core region surrounded by a cladding, where the core 

material has a hzgher refractive index than the cladding. To appreciate how light may travel 

do wn the waveguide we consider a ray which passes through the centre of the guide and 

bits the core/cladding interface at an angle that gives rise to total internal reflection (see 

U 4). The ray is then able to travel in a zigzag path down the core of the guide as shown 



eq 



in Fig- 



8.1. 



Chiiklint: 



Light ray 



Core 




flC 

the 



8-1 



The basic strut tun- ot ,in optii al liber. A < ore rvuk 



re 



fractive index of the core materia! is greater than ih.it ot the dadd 



K'»n is surrounded by a cladding region where 
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Although a full treatment of the propagation of light clown such circular waveguides {or 
optical fibers as they are often called) requires the solution of Maxwell's equations, which 
is beyond the scope of the present text, it is possible to gain a reasonable understanding by 
using simple ray theory and to this end we take a closer look at the phenomenon of total 
internal reflection. 



C 8 - 1 } Total internal reflection 

When an electromagnetic wave is incident upon the boundary between two dielectric media 
whose refractive indices are n l and n 2 , then in general a portion of that wave is reflected 
and the remainder transmitted. Maxwell's equations require that both the tangential com- 
ponents of? and 3f and the normal components of D (= e r E %) and B {= u, r Li 3€) are continuous 
across the boundary. A detailed consideration of the consequences of applying these con- 
ditions is too lengthy to be reproduced here; however, this is quite straightforward and is 
given in many textbooks (see e.g. ref. 8.4). The resulting equations are known as Fresnel's 
equations and are summarized below. We suppose the wave to be incident on the interface 
at an angle 8, to the normal and that the reflected and transmitted waves are at angles 9 r and 
9, respectively, as shown in Fig. 8.2. These angles are related by the equations 



6, = e r and 



sin 9, n 1 



sin 9, 



(8.1) 



Fresnel's equations deal with the magnitudes of the transmitted and reflected electric fields 




FIC. 8.2 Illustration of the behaviour of a light ray incident on the boundary between two media with 
refractive indices n, and n 2 where n, > n 2 . In general, a transmitted and a reflected beam are produced. 
The angles of incidence, transmission and reflection are 8„ 9, and 9 r , respectively. 
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(/;„ V: r ) re alive to the incident field 6,. We must distinguish between the cases where ,he 
ele ctric field vector is parallel to and perpendicular to the plane of incidence. We denote these 
fidd components by '« and f\ It should be noted that later on we use the notations TM 
transverse Magnetic) and TE (Transverse Electric) to denote these two polarization situ- 
ations. Thus latter notation arises because for TM polarization the magnetic field always 
remains perpendicular to the plane of incidence, whilst for TE polarization i. is .he electric 
field which always remains perpendicular to the plane of incidence. Fresnel's equations cm 
then be written 



<^_. 



, - «, cos 



T^x. n l cos 6j +- n 2 cos 8, 



and 



/?, cos 8j - n 2 cos 8; 



(8.1a) 



x 



COS 0. + Hi COS 



2/7 , cos 6, 



Hi-- 

^Tx ^. cos t + /;, cos 

& ; I ' - 



«?„ 



2/i, cos 8, 



(8.1b) 



'«, 



COS 0. + /Zi cos 



We 



wi ,l be particularly interested in the situation when „, >„„ since it is then jble fQ 



ha ve total internal reflection (see section 4.6.4); that is, when 8,>6 c where 



e c 



= sin 



W"i) 



(8.2) 



is no transmitted wave in the second medium. Further confirmation that total internal 



there - ~. 

flecti° n occurs may be obtained as follows: we have that 

sin 2 9,) I/2 



c os e t - c. 

a substitution for sin 2 8, from eq. (8. 

r InX- , 
sm-8; 



aives 




cos 6 t - 

e .>e c , sin 9, >;»,///, and cos 9, then becomes wholly imaginary. We may therefore 



VVTien 
V^rite 

t =±i5 (6,>G, 



COi 

^re 



(8.3) 



ii'- 



r ^i sin 2 9- 



^net 

-[fe 

lUSt take the negative sign in eq. (8.3) for reasons which will be explained later (see 



^% 7 and the subse 4 uem discussion). 
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Putting A = (njn 2 ) cos 8; and substituting for A and B into the first equation of eq. (8.1a). 
we obtain 

t± A + iB 

-T = (8-4) 

%i A-iB 

Inspection of the right-hand side of eq. (8.4) shows that it has a modulus of unity, which 
indicates that when 9, > 6 L . the irradiances of the reflected and the incident beams are equal 
and thus there is no transmitted beam. In other words we have total internal reflection. Since 
the electric field ratios are now complex quantities, however, a phase shift is present 
between the incident and reflected beams, which we now calculate. We may rewrite eq. 
(8.4) as 

'£;)■ cos xi/ + i sin \\i exp(iw) 

-I = . . = , \ = exp(2i V ) (8.5) 

6; cos xjf - l sin \|/ exp(-iv)/) 

where 



tan \j/ = B/A = 



n 2 [0ijn 2 ) 2 sin-Qt-l] 1 ' 2 



n { cos e ; 
or 

tan y = ^ (8.5a) 

COS 0; 

Similarly the second equation of eq. (8.1a) yields 

%] 

-^ = exp<2i5) (8.6) 

where 

tan5 = (/i,/,! 2 ) 2 tan y (8.6a) 

The phase changes on reflection for % L and '£" are thus given by 2\|/ and 25 respectively; in 
both cases % T leads % x in phase. The variation of both these phase changes as a function of 
8; is shown for a glass/air interface (i.e. n { = 1.5, « 2 = I) in Fig. 8.3. 

Although all the energy in the beam is reflected when 9, > 8 t . there is still a disturbance 
in the second medium whose electric field amplitude decays exponentially with distance away 
from the boundary. No energy is conveyed away from the surface provided the second 
medium extends an infinite distance from the boundary. We may derive an expression for 
this decay by considering the phase factor 9? of the transmitted wave, which at a point r we 
may write as 

5? = exp[i(o)r - k ( ■ r)] 
where A, is the wavevector associated with the transmitted wave. Reference to Fig. 8.4 shows 
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F ,C 8-3 Th J variation of the phase shift on reflection as a function of the angie of incidence at an inter- 
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vvhere there is total internal reflection 




4 illustration of the relationship between the rectangular coordinates y and z and the distance r 
t*e* from the o^n O. We have OB = OD + DB = OC sin 9 + BC cos 9. Hence, in terms of h e 



r vve may wnte ' as : s,n 6 < + v cos e t- and hence we have 



= exp 



1 I our- 



(z sin 8. + v cos 9, 
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where X () is the wavelength of the radiation in vacuum. If we substitute expressions for sin 8, 
and cos 9, from eqs (8.1) and (8.3), we obtain 



'.'? = exp-ji 



cof- 



Lnn 1 



sin e+C + LSh- 



expj±fl 



iKIh 



K 



v exp 



2nn, sin 6: 



if wt- 



(8.7) 



Thus, in the v direction the wave either grows or decays exponentially with distance. The 
former situation is obviously a non-physical solution and we must choose cos 9, = -\B in eq. 
(8.3). 

The decay with distance in the second medium is given by the factor F{y) where 

F(y) = expj -By] 



exp 



2mu 



X 



o 



1U 



"l'/2 



sin" 



- 1 



(8.8) 



Usually, F(y) decays rapidly with v (see Example 8.1). However, when Q t is very close to 
G L . then [(njn 2 ) 2 sin 2 9 1 - l] 1 ' 2 will be close to zero and the disturbance may extend an appre- 
ciable distance into the second medium. That part of the field which is in the second medium 
is referred to as the evanescent field. 



EXAMPLE 8.1 Field penetration into the less dense medium during total internal reflection . 

If we again take a glass/air interface, n , = 1 .5, n 2 = 1 . with 6, = 60°, from eq. (8.8) we 
obtain 

F(v) = exp(-5.1>'A ) 

Thus, in a distance equal to the wavelength, the magnitude of the electric field will 
fall by a factor exp(-5.1) or 5.5 x 10~\ If, however, we take a value for 8; which is 
much closer to 9 C (=41.8°), say 42°, then we have 

F(y) = exp(-0.54vA ) 

As a function of distance the decay is now less rapid than before, and in fact the field 
decay factor at a distance v = X is now exp(- 0.54) or 0.58. 



C s - 2 ) Planar dielectric waveguide 

At this stage it is useful to examine one of the simplest forms of waveguide, namely the planar 
symmetric dielectric waveguide. There are two main reasons for this. First it is possible to 
make considerable progress in understanding the behaviour of these guides using ray theory. 



PLANAR DIELECTRIC WAVEGUIDE .l(>. r > 

and secondly many of the resulting ideas can be applied, with relatively minor modific-i 
tions, to circular waveguides. In addition planar waveguides are of practical importance in 
their own right: for example, they form the basis of the waveguides used in integrated opto- 
electronics (see section 9.4). The waveguide itself consists of a semi-infinite slab of dielec- 
tric of thickness d and refractive index /,, sandwiched between two regions both of refractive 
in dex n 2 , where n x >n, (Fig. 8.5). The central slab is called the core whilst the surroundiim 
reg ions are called the cladding regions. A ray of light may readiiv propaeate down the core 
by taking the zigzag path as shown in Fig. 8.6. provided that total internal reflection occurs 
at the core/cladding interfaces. Thus we require that 90° > 6 > 6 C . where 9 is the internal rav 
angle (note that in the interests of notational simplicity, from now on we refer to the intcr- 

I ra y angle as 9 rather than 8,). 

Although in Fig. 8.6 we have drawn only one ray. we must realize that in fact an -infinite' 
number of such rays, all slightly displaced from each other, will also be propagating down 




vVi 



^ 8.5 Structure of a planar symmetric dielectnc waveguide. A layer of refractive index n, is sand- 
j between layers of refractive index n , where n, > ;? ,. 



n, (< «i 
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The zigzag path of a light rav down a planar dielectric waveguide that 



A ' ' r( , a t the boundary 9, is greater than the critic ai aneie 
ir 1 



results when the angle of 
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the guide. Actually, the rays merely represent lines drawn normally to the plane wavefronts. 
Figure 8.7(a) shows a wavefront for the situation where the ray angle, 8, is less than 45° whilst 
Fig. 8.7(b) shows the situation where 8 is greater than 45°. We deal with the former situa- 
tion first. All points along the same wavefront must have identical phases and if we con- 
sider the particular wavefront, FC, drawn in Fig. 8.7(a), we see that it intersects two of the 
upwardly travelling portions of the same ray at the points A and C. Unless the phase at the 
points C and A is the same, or differs by a multiple of 2n, destructive interference will take 
place, making it impossible for light to propagate down the guide. To calculate the phase 
difference between A and C we must take into account two factors, first the path length of 
AB + BC and secondly the phase changes due to reflection at B and C. For convenience we 
may write the phase change on reflection simply as <J)(9) where, for TE radiation, <t>(0) = 2\\i 
and, for TM radiation, <j>(6) = 28 (see eqs 8.5 and 8.6). Thus the total phase change may be 
written 



(AB + BC) 



Inn, 



!<t>(6) 
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FIG. 8.7 Ray paths having the same internal angle 8 within a planar waveguide. A wavefront is shown 
connecting the points F, E, A, C, which must therefore have the same phase. 
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Not e that the two phase terms have opposite sign. This is because the basic equation of a 
wave A =A> cos ( w/ " kr ) "nplies that the phase at the point r 2 will be less than that at a point 
r if r 2 > r x , that is the phase decreases with increasing r, whereas when a wave suffers total 
internal reflection its phase increases (see eqs 8.5 and 8.6). 

From the triangle ABC we have that AB = BC cos 2G. and hence AB + BC= BC( 1 + cos 26) 
oT s ince cos 29 = 2 cos 2 9 - 1 , AB + BC = 2BC cos 2 6. Also, from the triangle BDC we have 
gC cos 8 = d, so that finally, AB + BC = 2d cos 9. 

The condition for the mode to propagate is therefore 




:mK (8.9) 

where m is an integer. 

So fa r the dlscusslon has onI >' covered the situation where the internal ray angle in the 
e uide is less than 45 °- Tumin S to Fl g- »-7(b). we again require that the phases at°A and C 
JL equal, but this time the distance AB represents a negative path length since the ray would 
^! ve to travel 'backwards' along AB to get from A to B. Thus the total phase change between 
A and C can be written 

271/1, 

(BC-ABJ— -2*9) 

is le ft aS m exercise for the reader to show that in Fi S- 8J ( b ) (BC- AB) is equal to 2d cos 9, 
that in fa ct the P hase condition represented by eq. (8.9) applies for all values of the inter- 
^ lra y angle G. 

For each value of m there will be a corresponding value of 9, namely 9„„ that satisfies 
(8.9)- The difficult y is ,hat the dependence of <t>(6) on 6 (eqs 8.5a and 8.6a) is such that 
***' cannot obtain an explicit expression for G in terms of m. Equation (8.9) may, however 
yC lved either graphically or numerically and it is useful to consider a graphical solution 
I? 6 ' trieure 8.8 shows graphs of the equations 

y ^nm + ^ (810) 

er al different values of m. Also plotted is the curve 

•dently th e intersection of the two curves given by eqs (8. 10) and (8.1 1) results in a value 
€o Which will -solve eq. (8.9). Note, however, that we are only interested in solutions where 
valu e of 9 is greater than the critical angle 9,. otherwise the ray will not be totally inter- 



value or o i» e^— "«« » 
t^ e gflected at the interface 



diy 

Thu 



for each value of/?; there is at most one value of 9 (i.e. 9,„) which will solve eq. (8.9). 
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m = 



FIG. 8.8 Illustration of a graphical solution to eq. (8.9). For the particular waveguide dimensions and light 
wavelength used here there are only Iwo possible ray angles which satisfy both the phase requirements ol 
eq. (8.4) and the condition for total internal reflection: these are designated by 9 n and 0,. 



Each value of 9„, (> C ) is, as we shall see later, associated with a distinct distribution of elec- 
tric field across the guide. Such a distribution is referred to as a mode. (We have met the 
idea of modes previously when dealing with lasers in Chapter 5. Indeed the present discussion 
is directly relevant to the possible modes in a semiconductor laser.) The characteristics of 
the mode depend not only on the value of in but also on whether the phase shifts on reflec- 
tion result from the T- n and l L situations. Since these two polarizations are also referred to 
by the symbols TM and TE, the resulting modes are known as TE,„ and TM„, modes. Thus 
a TM,„ mode involves the field components M t . 'l v and '£_, whilst a TE,„ mode involves the 
field components '£,.. 'M v and '#. (Fig. 8.9). 

If a particular mode has a value of B OT where B,„ = 9 c . we say that the mode is at cut-off. 
^ e ™ < e L tne mode is below cut-off '(and in consequence will be rapidly attenuated and hence 
not propagate for any appreciable distance), whilst if 9,„> 9 t . the mode is above cut-off. It is 
evident from Fig. 8.8 that both TE„, and TM,„ modes will be at cut-off if 



2ndn, cos 9 



Xn 



— = mn 



(8.12) 



Since cos 9 t . = (l -sin 2 B c )' /2 = [l -(/? 2 /h,) 2 ] i/2 we will have mode cut-off when 




= nm 



or 
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V = -m 

1 



(8.13) 



PLANAR DIELECTRIC WAVEGUIDE .W) 
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(b) — 




F |C. »- 9 A ray W ' th * ' n ' ht ' P ' tlne (>t lfU idena> '' r ' S nL>s r,se to *«? three field components M , t and I 
f orres ponding to a TM mode. A r.u with X in the plane ot incidence (b) gives rise to / v ■*, .wl )< and ' 
rreS ponds to a TE mode. 



vvhere 



v/ = 






1 




ml 



(ir.-n-) 1 



(8.14) 



The parameter V is referred to in a variety of ways: 1 simply as the V parameter, the nor- 

lizedfH m thickness, or the normalized frequency. 
' fia \l is evident then that the number of guided TE. or TM, modes in a guide, N, is given by 

^I+INTW/k) (jU5) 

INT indicates that the mteger part of the following expression is taken. 



vv 



here 



wpLt 8.2 Number of modes in a planar dielectric guide 

^ e consider a planar dielectric waveguide 100 mil thick where ,/, = 1.48,//,= 1.46 and 
^j-iere light of wavelength I urn is being used. For this guide 

TtxIOOxHr 6 

v— 7THF-"- 4s -- , ■ 4fv,l: - 7,, ■ 2 

^us 2V/7t = 48.5. Equation (X. 15) then gives the total number of poss 
^ E and TM)as2x[l+lNT(4S .5 )] = 2 x ( 1 + 48) = 9K. 



ible modes (both 



A* 1 



interesting result that follows directly from eq. (8. 15) is that only one mode 2 (i.e the 
__~Hp1 will nromiL'nle if 



^ 



mode) will propagate if 



V 



^ 



n/2 



(8.16) 



XJ ~ g uide is ca,Ict! a si " Klt ' " UHk ' ^" ulc - Tllis rcNult implies that the ;;/ = mode has no 
^ U t off' s0 that " ' n the ° ry ' wha,cvcr Ihc dimensions of the guide the lowest order mode will 
^l!x/ay s P ropagate ' (This '■" not thc casc lf lhe - uide is symmetric; that is, it has media of 
i ■ f ring r efractlvc uuhcL ' s abo ^e ami below the central layer, see section 9.4.1.) 
A' 
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EXAMPLE 8.3 Single mode guide dimensions 



From eq. (8.16) the condition for single mode behaviour can be written as 
d 1 



2^ J/2 



X„ 2(n\-ir 2 

Again taking a guide where /;, = 1.48 and n 2 = 1.46 and using a wavelength of 1 jim, 
the guide has to have a thickness d such that 



d< 



1 



2([.48 2 -1.46 2 ) ,/2 ^ m 
That is, the waveguide core thickness must be less than 2.06 jam. 



As we have seen, a given mode can be regarded as being made up from an 'infinite' col- 
lection of rays travelling down the guide with the same value of internal angle 9. If we con- 
sider any particular point within the guide, only two of these rays can pass through it. One 
will be directed 'upwards', the other directed 'downwards'. Since, in general, there will be 
a phase difference between these two, they will interfere and thereby give rise to a varia- 
tion in the field amplitude across the guide. Each mode is characterized by a different field 
variation. 

This phase difference, as a function of position within the guide, may be determined 
from Fig. 8.10. This shows two rays meeting at a point C, a distance v above the centre 
of the guide. The line AC represents a wavefront, and hence the phases at the points A 
and C must be the same. Thus the phase difference between the two rays meeting at C, 
A<t>(y), arises from a path difference of AB + BC together with a phase change of <J>(9) on 
reflection at B. 3 

We have that AB = BC cos 29, so that AB + BC = 2BC cos 2 9 (as shown above). Since we 




Guide 

centre 



FIG. 8.10 Two 'oppositely' directed rays meeting at a point C a distance y above the guide centre. 
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a Iso have BC = [(d/2)-y]cos 6. it follows that AB + BC = 2[(c//2) - v]cos 9. Thus we may 
write 

(J \ 2nn, cos 6„ 
V y )—K-"- *' 6) 

substituting for cos 6,„ from eq. (8.9) we obtain 
'd \2tu?, [>m + §(§)\k„ 



(d \2m, 



2mln { 

or 

2v 
A cp(v) = ^-^-["' 7t + ( t ) ( e )] (8.17) 

the resultant of two waves with a phase difference A<P( v) may be written as 

fsOW 

re i C OS(0)0 + COS[Oi)r + A*(v)]} 

AO(v)\ /A<I>(y)\ 



/ A4>(v)\ // 



effective amplitude of the electric field is thus given by 2/' (1 cos[A#(_v)/2] which from 

/■« 17) can be wrinen as 

f mn v \ 

' ' [/;ijt + <t>(9)] (8. 18> 



2^o coS (T </ 



A 1 



the 



in 

in^P 

are 



. centre of the guide ( v = 0) this will have the value ±2/ (1 or zero, depending on whether 
&v en or odd. At the guide edges (v = J/2) the amplitude is 2f cos[<|>(9)]. The complete 
rn '" fjeld pattern is obtained by matching this cosinusoidal variation in the core with the 
tri ° nen tial decline in the cladding as given by eq. (8.8). Figure 8.1 1 shows the variation 
e*P° ctr i c field amplitude across both the core and the cladding for the first fourTE modes 
* e Janar dielectric waveguide. In fact for the particular waveguide parameters chosen, these 
on \s TE modes allowed to propagate. 
1 uJ . e 8.12 illustrates the mode field of the TE, mode both close to and far from cut- 
U is - seen thal a IargC P ro I 1orlio " ol Ihc mode field extends into the cladding near cut- 
off- f. dC i cut-off marks the point when all the mode energy resides in the cladding. This 
off' iour is readily understood from an inspection oi' Fig. 8.8 since, as V decreases and 
t>eh a . is a pproached. 6„, will get closer to 9 and hence both v|/ and 5 (and hence <K9)) will 
c^ l '° . oV yards zero. This in turn will cause the field amplitude at the cladding boundary to 
te se. I" adcl ' tion we havc all ' cacl . v notc<i m; » «h^ rapidity of decay of the evanescent field 

\X\& & *' a s the guide angle 9 approaches H % (Example K. I ). A similar trend can be observed 
jr» crea * g 1 1: as the mode number increases the proportion of the mode field that is in the 
jn * j-'o increases. This again may be understood bv notirm that as m increases 9 will get 

Via"" 00 '' 



c 
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Mode far from 
cut-off 



Cladding 



\ 



Mode close to 
cut-off 



Core ■ 



Cladding 



c 3.1 2 The mode field of JE > when jt ,s d( -* e !f) cut-off and tar from cut-off. When close to cut-off the 
f ' \f> field spreads out an appreciable distance into the cladding. 



s~^3~} Optical fiber waveguides 

thei r simplest form optical fibers have a cylindrical geometry with a core region surrounded 

a cladding (Fig. 8. 1 ). A.s in the planar waveguide the refractive index of the core is greater 

by n th at of the claddin £' Such flhers are referred to a.s step index fibers because of the shape 

lI fthe refractive index profile (Fig. 8. 1 3). The resulting structure is often covered by an addi- 

rial coating layer (usually of plastic) which serves to strengthen the fiber and to provide 

tl ° .«'nn aeainst chemical attack. As with the n1:in;*r whvpuimHp , H; .; nn ;„ „^._:„ t _j ._ 



t,C "' , c tion against chemical attack. As with the planar waveguide, radiation is restricted to 
\ j^.«/n the euide in certain 'modes', with r^rh mnHp mrrscnnn^^ .„ „ ^:„.: .i_ _ 

^orriag 1 



f ve l down the guide in certain 'modes', with each mode corresponding to a distinct elec- 
tra rnetic field distribution across the guide. The equations describing the modes in cir- 



;U lar 



waveguides are much more complicated than their planar waveguide counterparts, and 



I 



flO. 



.13 



Refractive index protiic nt the -.ti-p tndev iibcr. 
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wc make no attempt to derive them here (.see e.g. rcf. 8.5). However, we can take a number 
of the concepts developed for planar waveguides and apply them, with some modification, 
to circular waveguides. 

We may divide the rays travelling within the fiber into two types, meridional rays and 
skew rays as illustrated in Figs 8.14(a) and 8.14(b) respectively. The former pass through 
the central axis of the guide and are in many ways similar to the rays in the planar wave- 
guide. As in the planar waveguide they give rise to modes which are designated transverse 
electric (TE) and transverse magnetic (TM). In this case, however, two integers / and m 
are required to specify completely the mode rather than just one {in) as before (essentially 
because of the two-dimensional nature of the guide). We thus refer to TE,„, and TM,,,. 
modes. 

Skew rays have no analog in planar waveguides and describe angular 'helices' within the 
guide. Because of the angles involved, components of both % and "% can be transverse to 
the fiber axis. Consequently the modes originating from skew rays are designated as either 
HE /„, or EH /)H depending on whether their magnetic or electric characters are more impor- 
tant. It is also possible to launch some skew rays which, strictly speaking, do not correspond 
to bound modes, but which nevertheless can propagate for appreciable distances before being 
lost from the fiber. The resulting modes are called leaky modes. 

In most practical waveguides, the refractive indices of the core and cladding differ from 
each other by only a few per cent and it may be shown (ref. 8.6) that the full set of modes 
(i.e. TE /m , TM /m , HE,,,, and EH,,„) can be approximated by a set of so-called linearly polar- 
ized (LP,,J modes. The electric field intensity profiles of three such modes are illustrated 
in Fig. 8.15. An LP,,,, mode in general has m field maxima along a radius vector and 2/ field 
maxima round a circumference. On a ray picture, / is a measure of the degree of helical propa- 
gation: the larger its value the tighter the helix. The integer in. on the other hand, is related 
to the angle 8: the larger m the smaller the value of 6 involved. 

As in planar waveguides (see eq. 8.14) the number of modes in a step index circular 





(b) 



FIG. 8.14 R^y P at ^ s in ste P inde * fibers: U) meridional rays; (b) skew r< 



ays. 
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Core outline 





\G. * 



LPo,, 

(a) (b) 

! 5 Electric field distributions in three circular waveguide modes. 



(c) 



2tui 



ide is determined by a V 7 parameter, where now 



V** 



{n\ 



nlf 2 



^o 



(8.19) 



fiber can support only one mode when V'<2.405 4 whilst when V> 1 the number of 
* TV, that can propagate is given by 



m 



des> 



N 



^v 2 /i 



(8.20) 



^t from very special circumstances fibers are either multimode. where the fiber core diam- 
^P is s uch that the number of modes is very large (e.g. see Example 8.4), or single mode 
clC n only one mode can propagate. In multimode fibers the modes are so numerous that 
vvJie e ff e ctively involve a continuum of ray angles within the guide. This enables an analy- 
tft c y . sorn e of the guide properties to be carried out using ray theorv. 



IPLE8.4 



Number of modes in a fiber 



yj c consider a fiber where /; , = i .48 and n , = 1 .46 and a„ = 900 nm. If the core radius 
1 00 M- m - ^ en t * ie va ' Lie <)f ^ ' s given by 

2ti x 100 x K) "(1.48 : - I.46 : ) i: 

=169.3 

v 0.9 x 10 " 

^. nC6 V>h the number of modes able to propagate is given, approximately by 
S£ (169.3)72 =14 329. 
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8.3.1 Step index multimode fibers 

We assume in our analysis that we need to deal only with meridional rays. This approxi- 
mation owes more than a little to the fact that skew rays involve some rather unpleasant geom- 
etry! However, in practice the approximation yields reasonably acceptable results. The path 
of a meridional ray entering a fiber and undergoing total internal reflection is illustrated in 
Fig. 8. 1 6. The angle a that the ray in the external medium (usually air) makes with the normal 
to the end of the fiber is related to the internal angle G by Snell's law. so that 

sin a n. 



sin(90°-fl) ;;„ 
Hence 

n \ 
sin a = — cos G 

"o 

The maximum value that cc can take, a m:tx , is thus determined by the minimum value that G 
can take, which is of course the critical angle 6 C . Thus we have 

/ 1 - 2n \ 1/2 / 2 "\ IP 

"<> Sm «m;i* = "]n _Sin G c) = ('*! ~n 2 ) 

The quantity [n\ - n 2 2 ) l/2 is known as the numerical aperture (NA) of the fiber, and hence 
cc.^sin-'CNA/H,,) (8.2 1) 

a nm is known as xhe fiber acceptance angle. Another parameter that is used in connection 
with step index fibers is A, where 

A = (n~-n\)/2n 1 l (8.22) 




Cladding 



Core 



FIG. 8.16 Illustration of the path of a meridional ray (solid line) as it enters a circular step index wave- 
guide from a medium of refractive index n„. The ray is incident on the end of the fiber at an angle a to the 
normal. Inside the waveguide, the ray makes an angle 9 with the normal to the guide axis. Also shown 
(chain line) is the path of a ray that does not undergo total internal reflection at the core/cladding interface 
and does not, therefore, correspond to a bound mode. 
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„ n and n, usually onlv differ bv a few per cent, we mav write 
Since i 

As -(/»i- n z)/"i (»i s »:) (8.22a) 

nli rnerical aperture can be expressed in terms of A by 

NA 



The 

= /7,(2A) 1/2 (8.23) 



uld be noted that skew rays can have larger launch angles than meridional rays, so that. 

S tlV speaking, the maximum acceptance angle of a fiber will be somewhat larger than 

• stn _jven by eq. (8.21). For most practical purposes this extra complication is usually 
that g 

'£". ore 8.16 also shows a ray which is incident on the fiber end at an angle greater than 
This ray is refracted into the cladding and does not correspond to a bound mode. It 
ft m i " 1 ' however, undergo total internal reflection at the cladding/coating (or cladding/air) inter- 
rf 1 ^' j ^us be able to propagate down the fiber to some extent, although the attenuation 
fa ce ~ C [e& to be high. This is an example of a cladding Diode and, although lossy, such modes 



isexpe 



ve 



rise to misleading results when short lengths of fiber are used to determine such 



c cters as fiber attenuation and numerical aperture (see section 8.6). 



eters 



param 

■ F 8.5 Calculation of typical fiber parameters 



We 
est 



c onsider a multimode fiber in air with /?, = 1.48, /i ; = 1.46 and h = 1. The small- 
value of the internal angle is 8^. which is given by sin 6 t .= 1.46/1.48, whence 



^ 80.69° 

. „ e-a (8.22) the parameter A is given bv 

A# (1.48 2 - 1.46 : )/(2x I.4K-) = ().OI3 42 

a pproximate expression, eq. (8.22a). is used instead we have 

4 _n 48- 1.46)/1.48 = 0.013: SI 
A -~ k 

We * ls ° haVe 

NA = (1.48- 1.46-)' -- = 0.242 

va lue of a„ m is then given In eq. (8.21 ) to be 

« sin" '(0.242)= 14.0° 




8.3-2 



intermodal dispersion 



that on a simple ray model the rays corresponding to TE and TM modes are merid- 



y^e k^ 1 s which follow a simple /ig/ag path down the fiber core, very similar in fact to the 

; i 
a y P'^V c Yi the ray travels down the waveguide depends on its internal angle. Thus along 



; jn the planar waveguide (Fig. 8.6). It is evident from this diagram that the speed 
i_;^h the ray travels down the waveguide depends on its internal angle. Thus along 
Wit** 






:*78 FIBER OPTICAL WAVEGUIDES 



the ray path the velocity will have a magnitude oiY/h,. whilst the component of" this veloc- 
ity along the fiber axis is (</»,)eos(90 o -9), or (c/;f,)sin 6. The highest component veloc- 
ity will be given when 6 = 90° whilst the smallest will occur when 8 = 8 . If we consider a 
length L of fiber, then the ray with the highest component velocity will take a time At to 
traverse the fiber where 



L)i t Ln x 

c sin 90 



Similarly the ray with the lowest component velocity will take a time At where 



At,, 



rsin9 t . vn-, 
The difference between these two may be written 

Ar SI = AT m ... - At 



n ut x 



c n 2 

Thus if a pulse with an extremely narrow temporal width were to be sent down the fiber we 
would expect it to emerge from the other end with a maximum pulse width of Ar si where 
Ar M is given by eq. (8.24). Since wc associate different ray angles with different modes, an 
alternative way of discussing the broadening is to say that different modes travel down the 
fiber with different velocities. This helps to explain the name given to the phenomenon of 
intermodul dispersion. 

It is worthwhile pursuing the mode velocity concept a little further. If distance along the 
fiber is measured by : then the mode fields travel down the fiber with a ; and t dependence 
which can be written as exp[i(tor - (3z)], where [3 is the mode propagation constant (for plane 
waves the corresponding parameter is usually written as k, sec cq. 1.6, and is known as the 
wavenumber). The mode phase velocity is then w/P, but the velocity with which energy can 
travel down the fiber is given by the group velocity dw/dp (see eq. 1.8). A schematic vari- 
ation of co with (5 for a particular TE or TM mode is shown in Fig. 8. 1 7(a). At large values 
of to (or P) far from mode cut-off. the group velocity tends towards the value c/n v As we 
move down the curve towards cut-off. the group velocity slowly decreases at first, to be fol- 
lowed by a final rapid increase to the value c/n 2 , as cut-off is approached (Fig. 8.17b). 

We can gain a qualitative understanding of most of this behaviour by using the simple 
ray picture. Far from cut-off, a mode has an internal guide angle that approaches 90°. and 
hence its effective velocity down the guide should indeed approach c/n v As we move towards 
cut-off, the ray angle decreases and, because of the increased path lengths involved, the effec- 
tive velocity down the guide decreases. However, the final rapid increase in velocity for the 
lowest possible P values cannot be explained using this simple picture. The discrepancy arises 
essentially from our neglect of the phase change on reflection. This may be incorporated 
into the ray model by assuming that the ray penetrates some way into the cladding at each 
'reflection' as shown in Fig. 8.18. Close to cut-off the ray spends an increasing amount of 
time in the cladding where it travels faster than it would in the core, thus giving rise to an 
increase in the overall velocity down the guide. It should be mentioned that some EH and 
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Schematic relationship bchuvit i.it to .mil fi, .nul lb) between dw/df3 and P for a particular 
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l>( ' n " ,r -" i <»" 'nt-MltVuir^f " de iIlustr « ltin R th ^ origin of the phase change on ref'Iedi<>" m ^pl^' 
sh "« ''ist..n< e ins.de tn( . t, "^ W ' lhin the « l " dt ' ^ ray behaves as if i! were reflated m 

an appreciable friert^ l? 6 COITes Ponding to highlv •.skewed" rays. 
Th« ....... .. un ot themodnl P n . .,_ •' ., . _,„u;„„ 



ln appreciable fraction fT COITespondin g t0 highly •skewed' rays, reach cu - 

Th e value, of the mn H , m ° daI ener S v extends into the cladding. lig h! ^ 

"P^-ted in v lew of 2 h7 ^ at and far ^ cut-off also agree with wM ^^^ 
,he Planar waveguide P , aviour of the mode field distributions as shown in ^ ^h." 
th * core and h„* : *' from Cut -off, modes have m^t of rheir energy co" taI " n(iin g ' ll 



J- 



the c °re and hence it T ° m CUNoff ' modes have most of their energy c°" w ' d jng 
that of a Plane wave JJTr reaS ° nable th at they should have a velocity coT^P Q]fiX 
on th e other hand have g '" a medium of ' refractive index /,,. Modes cl<*f a 

hence might be expected," " PpreciabI e fraction of their mode energy in the cL 

Whe " many mo ? d ^ * have velocities approaching r/ii,. ffto farff ' 

^'t-off. then we expec, ?h ^ C ° VCrin 8 the wI ">Ie range from close to cut-oft t /(f ,. 
which is in fairly c lose „T m ° de veI «^ties will have a spread of roughly <l - 

^agreement with the result of eq. (8.24) above. 
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* x ™ « maximum puS^ 111 l km ^ «, = 1-48 and „ 2 = 1-46 then i * 
of "roddenmg due to intermodal dispersion as given by <V 
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Ar SI = 



1 x lo- 1 
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Despite the presence of mode coupling, the effects of intermodal dispersion still impose 
rather drastic limitations on the rate that information can be transmitted down the fiber. This 
will be discussed in more detail in Chapter 9; however, it is easy to appreciate at this stage 
that any narrow peaks in the optical irradiance will tend to broaden out in time as a func- 
tion of distance along the fiber and hence reduce the information content (especially at high 
frequencies) of an amplitude-modulated signal. These unwanted consequences of intermodal 
dispersion may be alleviated either by designing a fiber where the mode velocities are more 
nearly equal than in step index fiber, or by using a fiber where only one mode is allowed to 
propagate (i.e. single mode fiber). As we shall see. single mode fibers have rather small diam- 
eter cores and it was not evident at first whether they could be manufactured reliably or indeed 
whether such problems as launching light into such fibers and joining them together could 
be readily solved. Thus it was that the first low dispersion fibers were of the so-called graded 
index type. 



8.3.3 Graded index fibers 

In a fiber where the refractive index in the core region is not constant but rather decreases 
smoothly from the centre to the cladding, the modes tend to travel with mode velocities which 
are closer together than in the case of step index fiber. The variation in refractive index with 
radial distance, r, in such fibers is often expressed in the form 



H(r) = /j,[l-2A(r/fl)"] 1/2 r<a 



n(r) = nAl-2A) 



m 



a -> 



r> a 



(8.25) 



where a is the core radius and A is defined in eq. (8.22). The parameter a determines the 
core refractive index variation (care must be taken not to confuse it with the absorption coef- 
ficient). Figure 8.21 shows the refractive index profile of a fiber where cx = 2. 

The guided modes in such fibers arc generally similar to those in step index fibers, and 
may also be referred to as LP,,,, modes. In fact the step index profile can be seen to be a 
particular case of the above profile with a = «,. The number of guided modes, N, is given 




->' 



FIG. 8.21 Refractive index profile <>t \i jy.ided uidc\ hbet. 



OPTICAL FIBER WAVEGUIDES 3{B 



fry (ref. 8.7) 
N = 




As expected this reduces to the step index result (eq. 8.20) when „-«. 

It may be shown (ref. 8.8) that intermodal dispersion is at a minimum when a is close to 
tn e value 2 and that with a monochromatic source the resulting minimum pulse broaden- 
ing, At 1 ;;, 1 ,,,, over a distance L is given, approximately, by 

Since we usually have »,=«,, the result for the step index fiber dispersion (eq. 8.24) can 
fr e rnodified to 

Ax SI = L (^, -/?,)- — L A 

c n 2 ' c 

and so 

At -"„-«1 
^ 8 

fhus the dispersion in optimally graded index fibers should be much smaller than in step 
index fibers (see Example 8.7). It is interesting to note that when a = 2, the number of guided 
modes that the guide can support is half that for step index fiber of the same core radius. 
This implies that the amount of energy that can be coupled into graded index fiber is only 
a bout half of that which can be coupled into a comparable step index fiber. 

cXAM pLE 8-7 lnterm °dal dispersion in graded index fibers ___ 



Taking an optimally graded fiber of length I km with the usual core and cladding 
refractive indices (i.e. /;, = 1.48. n 2 = 1.46), we have that 

(1.48-- 1.46 2 ) 

A _ — = ().0 1 34 

2x1.48- 



From eq. 


(8.26) 


we have 


Ax ~ 


1 x 10 


x 1.48 


8x3 


x 10 K 



(0.0134) 2 - 1.11 x 10-'°s 

As expected this is a much smaller value for the dispersion than is the case in step index 
fibers (Example 8.6), where Ar sl = 6.76 x 10~ K s. 



Although in practice graded index fibers do indeed achieve much reduced values for the 
in termodal dispersion it is very difficult to obtain values that approach the predictions of 
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eq. (8.26). The reason for this lies in the extreme sensitivity of mode dispersion to variations 
in a. as shown in Fig. 8.22. Any slight variations in a that are likely to occur in fiber 
manufacture will easily increase the mode dispersion from its theoretical minimum value. 
The situation is further complicated by the presence of other forms of dispersion that become 
evident when intermodal dispersion is eliminated and which will be dealt with in section 
8.3.6. 

Before turning to single mode fibers it is interesting to analyze why, on a simple ray picture, 
graded index fibers have reduced intermodal dispersion. We consider a meridional ray which 
is travelling with a particular angle. 6. through the fiber centre. Instead of the fiber being 
continuously graded from core to cladding we suppose that the changes occur in a series of 
small steps. The ray will then undergo a series of refractions and, since the progression is 
from larger to smaller refractive indices, the value of 6 will increase. Eventually 6 will be 
large enough so that the ray, instead of being refracted, undergoes total internal reflection 
and is then directed back downwards towards the fiber centre as shown in Fig. 8.23(a). If 
we imagine that the refractive index steps increase in number whilst the change in refrac- 
tive index at each step becomes smaller, then the ray path will tend towards the smooth sinu- 
soid-like trajectory shown in Fig. 8.23(b). A more detailed analysis confirms this (ref. 8.9). 
At the optimum grading profile (i.e. where a = 2) different modes have very nearly the same 



100 r 

Mode 
dispersion 

(nskrrT 1 ) 




5 10 20 

Profile parameter, n 



50 



FIG. 8.22 Intermodal dispersion in ,i pr.ick'd index fiber ,is a tunc lion ot the profile parameter a. The 
curve has a very sharp minimum at ,i value of a just less than two. 



■wraEwssp* 1 "^ 



OPTICAL FIBER WAVECU1DES ;j«. r i 



Total internal reflection 




Decreasing 

refractive 

index 



Cladding 




Core 



Cladding 



(b) 



F 1C. B.23 (a) The path of a ray through a series of thin slabs of decreasing refractive index, (hi If the thick- 
neSi of the slabs reduces to infinitesimally small values we obtain a sinusoid-iike ray path. 



•wavelength' along the guide, but different excursion 'amplitudes' away from the fiber axis 
(Fig- 8.24a). 

We can now appreciate why mode dispersion is reduced in graded index fibers. Although 
the higher order modes (i.e. those with larger 'amplitudes') have longer path lengths than 
do the lower order modes, when they are further away from the centre of the fiber they are 
travelling in regions of lower refractive index, and hence higher ray velocity. Thus the higher 
order modes can compensate to some extent for their longer paths by having a higher average 
velocity. 

Only meridional-type rays have been discussed so far, yet it should be mentioned that the 
equivalents of skew rays in graded index fibers also exist and, perhaps not very surprisingly, 
turn out to be 'helical' rays which avoid the centre (Fig. 8.24b). 

8 3.4 Single mode fibers 

We have already mentioned that when the fiber V parameter is less than 2.405 then only 
one mode (the HE,, mode, or, in the linearly polarized approximation, the LP 0I mode) can 
propagate. Actually, strictly speaking two HE,, modes can be present with orthogonal 
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(a) 




FIG. H.24 Kay paths in .1 graded index fiber for (a) meridional rays and (b) helical rays which avoid the 
centre. 



polarizations, but for simplicity we will assume that we are dealing with only one of these. 
In theory, the HE, t mode will propagate no matter how small the value of V. As V decreases, 
however, the mode field will extend increasingly into the cladding (see the discussion fol- 
lowing eq. 8.18); if the field then becomes at all significant at the edge of the cladding, 
appreciable amounts of energy may be lost from the fiber, leading to the mode being highly 
attenuated. 

In terms of the fiber core radius, a, the condition (eq. 8.19} for a single mode to propa- 
gate is 



2.405L. 



a < 



2tt(/?7-»;) 



i/2 



or 



2.405L, 



a < 



2tc(NA) 



(8.27) 



This relationship implies that single mode fibers will have cores that are only of the order 
of X u (i.e. micrometres) in radius. It is advantageous from a number of points of view, 
however, that they have as large a diameter as possible. From eq. (8.27) we can see that 
this may be done by reducing the NA value, that is by making the core and cladding refrac- 
tive indices very close together. In practical terms single mode fibers are made with NA 
values of the order of 0.1, with a typical design criterion for a single mode fiber being 
2< V<2.2. When used with radiation in the wavelength region 1.3 urn to 1.6 jim. single 
mode fibers have core diameters that arc typically between 5 urn and 10 u.m as illustrated 
in Example 8.8. 
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EXAMPLE 8.8 Single mode fiber radius 

If we take our usual fiber core and cladding refractive indices (i.e. „, = 1 .48 and „, = I 46)^ 

I S m T 8 m f n h UmenCaI aperture of a242 ' If rad,arion of l - 5 ^ « ^ ^ ihcn 

from eq. (8.27) the maximum core radius for single mode operation is given by 

2.405 x 1.5 x I0" fi 
a< 

2n x 0.242 

that is, 

a < 2.37 ixm 

With an NA value of 0.1. the maximum value of a is increased to 5.74 um With a 
Re sign cntenon of ^~ ^he_core^iam^ gs r ' 



It should be noted that for a given core diameter a particular fiber will onlv be single mode 
when the wavelength of radiat.cn being used is greater than a critical value. X . whi h e 

the cut-off ™*len*h(s mce « represents the wavelength a. which the mode above th, we 
or der mode cuts off). From eq. (8.27) we have that 

3 = 2tu7(NA)/2.405 

(8.28) 

wh en a fiber is to be used as a single mode fiber, care must be taken to ensure thai the wave- 
lengths used never exceed the cut-off wavelength 

Although the mode field distribution in a single mode fiber ,s theoretically describee, bv 
BeSS el functions ,t is convenient to represent the field irradiance distribution (with Httle los 
in accuracy) by the much s.mpler Gaussian function, that is 

/(,-)= / exp(-2;- 2 /Q) ( ^ ^^ 

wh ere 2oys known as the > mode field diameter. co thus represents the radial distance at which 
the mode irradiance has fallen to exp(-2) (i.e. 13.5%) of its peak value. As the V parame 
ter of a fiber gets smaller we would expect that W() will increase (i.e. the field wil ex d 

fUrt r be 1 te rZ n £ iH 2^<\ ^^^P between .„ and V which ,s " u - 
rate to better man i /« ir 1.2 < V<3 is given by (ref. 8.10) 

COo /a = 0.65 + 1.619V-V2 + 2 . 879 ^ 

(A .:9a) 
eXA MP»-E 8.9 Mode field diameters i n single mode fiber 



A typical design criterion for a single mode fiber is V = 2. Under these circumstances 
the mode field irradiance diameter is given by eq. (8.29a) as '".slices 

w > = 0.65 + 1.619 x (2)^' 2 + 2.879 x (2)- 6 = 1.049 

T^us in this particular case the mode field radius is slightly greater than the core radius 
The resulting mode irradiance dis tribution is shown in Fig. 8.25. 
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FIG. 8.25 The radial variation of" the- mode field in a single mode fiber. As the V parameter decreases the 
mode field extends further and further into the cladding. 



8.3.5 Fiber materials and types 

At this stage it is useful to mention briefly the types of fiber that are available, the materi- 
als from which they are made and typical dimensions. For a fiber material to be of any prac- 
tical use it must obviously be highly transparent to the radiation being used. It must also be 
possible to fabricate the basic core and cladding structures as discussed above and fmallv 
it must be reasonably flexible so that it can be taken round bends etc. There are in fact only 
two materials that satisfy these criteria and which currently are widely used in fiber manu- 
facture, these being silica (SiO,) and various types of plastic. 

Typical plastic fibers have core and cladding made from different types of plastic. They 
have core diameters of about 980 urn and core + cladding diameters of about 1000 um. 
Maximum diameters are limited to a few millimetres, beyond which the fibers start to become 
rather inflexible. The two different types of plastic used for the core and cladding can have 
significantly different refractive indices, resulting in relatively large numerical apertures (i.e. 
0.4 or greater). The main problem with plastic fibers is that they exhibit much higher attenu- 
ations than silica fibers. A more detailed discussion of plastic fibers is contained in section 
8.7.3. 

Silica fibers by contrast can be made with attenuations as low as 0.2 dB km" 1 (see section 
8.4.2). However, the fibers become increasingly inflexible when their diameters exceed 
600 um; typical multimode step index fibers have core diameters of 200 pm, with 
core + cladding diameters of 250 um. The necessary differences between the core and 
cladding refractive indices are obtained by doping the silica. For example, doping silica 
with germania (Ge0 2 ) raises the refractive index, so that a .silica-based fiber could be made 
from a germania core with a pure silica cladding. Since it is difficult to change the refrac- 
tive index of silica by more than a few per cent by doping, the fibers have rather smaller 



OPTICAL FIBER WAVEGUIDES 



.W) 
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riumeri 

mcKle silica fibers are read^a'va'ilabk The WrT "^ ^ ^"^ indcx and * in ^ 
ste p index fibers (usually 50 um or £ [ U1 "a k rathersmaI,er ^''ianK.tcrs than 

a „ silica fibers have a coatin, of plastic' ma eH f JT" l™ ™ 1 Previously, almost 
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- -ical damage. The overall diameter ' rh h J* " P reventin e P^sa-al and 
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chem 

and 500 um 



8.3.6 Dispersion in single mode fibers 

By using single mode fiber the problems ■!«,-■• , a • L ■ 

Voided. However. *ere slll | « I:^^?" ^P™ " " «™Hcu-ly 

jVlatenal dispersion arises beraiKPnfm , * "'spewon. 

„,- the fiber serial and .ppta ^l^Tf * 1 ' "^"^ '"' ,he refcMW '"*« 

^ cons,s,,n8 of a f ' n,,e spr - ° f ««.. -- sc% 

dco 



d* 



Here co = 2th; and A- = 2n/l (X being the wavelength in the 



t' e = 



(XX 



d(lA) 



Now 

ax dx dx dx n (« ^dX;) 



material); thus 

(S.30) 

t». u being the refrac- 



so 



that 



v s (K-X d^/d^) 



(X.31) 



parameter that may be usefully introduced at this stage for later use is rh , ■ , 
■ n£ iex N r where ^ c ur ldrer use is the material group 



r»> 



us 



(8.32) 
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FIG. 8.26 Variation of the quantity K^n/dlKf, with wavelength for pure silica (SiO ,). The function 
becomes zero at about 1 .3 urn, which results in very small values for the materia! dispersion near this 
wavelength. The addition of dopants displaces the curve slightly. 



Now the group transit time, i g , for radiation of wavelength X u to travel a distance L in the 
medium is given by 



■; = L K 




(8.33) 

If the radiation in fact has a finite wavelength spread A*,,, then the resulting spread in transit 
times over the length L will be given by At™' where 

dx 

At™"*— &AX,, 

dl " 



AT n U l^ d 3 H AA () 



c d\l X n 



(8.34) 



Asjfiar as pulse spreading is concerned we are usually only interested in the magnitude of 
At™ 1 , although the sign of At'™ 1 in fact determines whether the long or short wavelengths 
arrive first (this may sometimes be important, see e.g. section 9.3.10 on solitons). 

A graph of the dimensionless quantity ^ dtyd*., 2 , as a function of wavelength for silica 
is shown in Fig. 8.26 and calculations of material dispersion based on this are given in 
Example 8.10. 



EXAMPLE 8.10 Material dispersion for a laser and LED sourc 



We use eq. (8.34) to estimate material dispersion effects over 1 km of silica-based fiber 
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when the followmg two sources are used: 

(a) u GaAs LED oneniftmr -tr »<;n 

• , ^ ' 85 ° nm and Wlth a 50 nm linewidtfr and 

^^f-«'»ve ta . lM o nm .y ll V-X=. 2 .M, 1 ,,- i . wIll , >IM1 ,, ( 

X o d-«Mf» = -I.02x 10^ 

Using eq. (8.34) we then have 

(a) for the LED Ax' nM = -J21_ -> M tA ,50xl{r" 

■ 1xK) 850 xiO" 

= 4.2x I0- 9 s 

(b) for the laser Az m -" = -J£L i m m- 1 3x|()- 9 

■ ixlU I550xl()- V 

= 6.6x lO"" s 

Inspectionof Fii» g ")6 shnu/c t^„ f • . li 

material dispersion vanishes altogether when rh min ' F nto,timateI y " '* "« the ease that 
to ^ Although the longest and sh ^7s 0UC w^" ' ^ n ?* °'' '* ^ * ""«" 
.roup velocity, the centra] wavelength in L be traVe,,i " e with lh < s »™ 

Situation is dealt with in Proble 8 f ^"^'"^^'^y higher verity. IT,, 

become extremely small at. orclo.se to Y ' ^ that nw,criaI dis ^«>" -»' 

We must now deal with the twn ^nlr.T' 
fibers. Profile dispersion an es bee ? "^ °' dil,pe ™' OT f " «"^ "«** optical 

In other words, ,t depends n^^T™ * ^ ""^ A W " h wov ^. 
Adding with wavelength may notbe h e s e Tr" *■ "^ '^^ fw ™ ™" 
and cladding are derived fromL san en a ^ fc^ " '""^ ^ ^ ™ 
by eq- 8-22) on wavelength is very small ^1^ dc P™lencc of A (as defined 

dispersion. Waveguide dispersion, how " can be '" "TT " "^ "^ " n ^ 
propagation velochy itself depends on w a e en gth re" d e" ^ " ^ ^'^ ,te n ^ 
^•ons of the medium. To appreciate how this com s bou wT ^f™^ » ld - vari- 
on propagation in planar dielectric waveguides (Te! ion 8 ,T P "' ', "' ^ '" " le ™ ti ™ 
if the wavelength propagating in the gu.de is i nc eas d s. h ', T ^ ^ We Can ** ,h « 
sio „ (2*/», cos 9 )A will decrease L hence me to I " " '" ^ Va ' UC of thc "P^" 
v „ ,„* + *(9) and the curve v = (2m/ll cos A * f , P ,nte "**°n with the curves „f 

^i,, change the effective velocity on h :^^ 
^odel the effective ve,oc,ty will decrease as "n C s^"' ' ^ ^ ° n a sim P te ™ 

For a step mdex single mode fiber waveguide disner 
jng ly negative with increasing wavelength. A diagram Uu^^Z ^ ^^ inm ^ 
gU ide dispersion together with the resulting totfd^e ' g T '"^ *"* Waw " 

• H 'Mon as a function of wavelength is 
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given in Fig. 8.27. In general the minimum dispersion wavelength for the total dispersion 
occurs at a somewhat higher wavelength than that for which material dispersion is a 
minimum. Since, in fact, the waveguide dispersion increases with decreasing values of the 
parameter V, reducing the core diameter will give an increased waveguide dispersion, and 
will thus shift l mm to higher wavelengths. The actual value ot'X mm depends on a. A and also 
the details of the refractive index profile. By a suitable choice of these parameters it is pos- 
sible to make k nun lie anywhere between 1 .3 urn and 1.6 urn. Such fibers are referred to as 
dispersion shifted fibers. By using more complicated fiber structures such as the so-called 
depressed cladding fiber (Fig. 8.28), it is possible also to obtain dispersion flattened fibers 
where the dispersion curve crosses the zero axis at two wavelengths (Fig. 8.29). This gives 




-30 L- 



FIC. 8.27 Illustration of the magnitudes of material dispersion together with waveguide dispersion as a 
function of wavelength; the resultant dispersion is also shown. 



FIG. 8.28 Refractive index profile of the 'W (or depressed < Lidding) iihci 
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FIG 



-20 L 

8- 29 The variation ot total dispersion with wavelength in a dispersion-flattened fiber. 



n 



s e to a fiber with relatively low dispersion over an appreciable wavelength range, which 
■ s us e ^ u ' since > as we sna " see in section 8.4. silica-based fibers exhibit minimum losses at 
' wavelength of 1 .55 urn. Thus it is possible to manufacture fibers which exhibit a minimum 
,. p C rsion and loss at the same wavelength. 



Losses in fibers 



\>Je i^ay divide fiber losses into two categories: (a) those which result from the distortion 
f the fiber from the ideal straight line configuration (extrinsic losses) and (b) those that are 
. ,,,-ent in the fiber itself (intrinsic losses). 



. -\ Bending losses 

multi 1110 ^ fibers the principal bending loss mechanism arises from mode coupling as dis- 
ced in section 8.3.2. From there we remember that bends in a multimode fiber can lead 
C the possibility that energy in a particular mode is coupled into a different mode. If the 
t0 - 1 jal mode is one that is close to cut-off. then, after the coupling process, the energy might 
rnd itself '" a non ~ guitlcd mode (ie - w 'th 8< 9 C ) which is refracted out of the core and into 
y. c cladding. It is evident that the energy which is in higher order modes is more likely to 
1 l oS t in this way than that in the lower order modes. 

t>e \Vhen a fiber is bent into an arc of a circle, the loss is found to be strongly dependent on 
r adius of curvature, and a semiempirical expression that is sometimes used to describe 



th e 



loss 



Cf-B 



= Cexp(-/?//? c ) 



(8.35) 



ne re « B is the absorption coefficient due to the bend, R the bend radius and R c a constant 
^ich depends on the fiber parameters. Quite considerable losses can be observed when bend 
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radii of the order of millimetres are involved, although such excessive deformations are quite 
easily avoided in practice and in any case may very well cause fracture of the fiber. 

Another cause of loss is a continuous succession of small deformations or microbends 
which can occur when the fiber is pressed against a surface which is not perfectly smooth 
(Fig, 8.30). A detailed analysis of the situation when a fiber is deformed into a small ampli- 
tude sinusoid of wavelength A (ref. 8.1 1) shows that modes with propagation constants (J, 
and B, will be strongly coupled together provided that 



A 



This expression shows that higher order modes are coupled together with relatively small values 
of A, whilst adjacent lower order modes are coupled with larger A values. Here 'small' means 
typically the order of 1 mm, whilst 'large' means about 20 mm or more. Since it is the higher 
order modes that are most susceptible to mode coupling losses, periodic deformations with a 
periodicity of the order of a few millimetres are best avoided. Such microbends can be caused 
by bare fiber being pressed against a protective jacket or wound tightly on a drum. Unless some 
care is taken, microbending loss can add appreciably to the intrinsic loss in a fiber (ref. 8.12). 
In single mode fibers bending loss is also present but a different mechanism is respons- 
ible. Figure 8.31 shows the field within a single mode fiber as it propagates round a bend. 
It can be seen that the part of the mode which is on the outside of the bend will need to travel 
faster than the part on the inside to maintain a wavefront that is perpendicular to the propa- 
gation direction. Now each mode extends, in theory, an infinite distance into the cladding 
despite the exponential decline of the electric field within i? (see eq. 8.8). Consequently some 
part of the mode in the cladding will find itself attempting to travel at greater than the speed 
of light. Since, according to Einstein's theory of relativity, this is not possible, the energy 
associated with this particular part of the mode must be radiated away. It is reasonable to 
deduce that the loss will be greater (a) for bends with smaller radii of curvature and (b) when 
the mode extends most into the cladding. In fact the loss can be represented by a function 
similar to that of eq. (8.35) with R c being equal to o/(NA) 2 (ref. 8.13) where a is the fiber 
radius. Bend radii of the order of R c are likely to result in considerable losses, although in 
fact the mechanical properties of the fiber usually cause the fiber to break long before such 
small bend curvatures can be achieved. 

8.4.2 Intrinsic fiber losses 

In this section we examine the losses in silica-based fiber that depend on the fiber material 
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FIG. 8.30 Smal! kinks or 'microbends' can be formed when a fiber is pressed against a surface that is not 
perfectly smooth. 
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g.3 1 Illustration of the mechanism at radiation loss in fibers at bends. To maintain a plane wavefront, 
a( -t oi the mode (shown shaded on the diagram! may have to travel at velocities greater than that of light 
rhe cladding. Since this is not possible, this portion of the mode must be radiated away. 



c\f (p last ' c fibers are discussed in section 8.7.3). Before discussing details of the loss mech- 
'srns it is useful to consider the unit commonly used to characterize loss in fibers. This is 

a R km"'- an ^ ' s d e f' nec l as follows: if a beam of optical power P i is launched into one end 
fiber and if, after a length L of fiber, the power remaining in the fiber is P { , then the 

° (or attenuation) is given by 

10 log ,„(/>,//>,. 
a ttenuation = — 



loss 



L 



dBkm 



(8.37) 



V° 



sse 



s intrinsic to silica fibers have two main sources: (a) scattering losses and (b) absorp- 



tio n l° sses ' 



; caT 



T ER|NG LOSSES 



W e 



have assumed in our discussion of light propagation in fibers that the material is homo- 



oU s. Silica is an amorphous material and thus suffers from structural disorder: that is, 
£ c sa rne basic molecular units are present throughout the material but these are connected 
ther in an essentially random way. This results in a fluctuation in the refractive index 
t0 & u gh the material with each irregularity acting as a point scattering centre. The scale of 
fluctuations is of the order of Xj 1 or less, and the scattering is known as Rayleigh scat- 
1 n e and characterized by an absorption coefficient that varies as X." 4 (ref. 8.14). In detail 
t£r at> sor P tion coc, ' ( ic ' ent clue to Ray'eigh scattering (oc R ) can be written 



tfr e 



.'Aft 



re n is the refractive index, p the photoelastic coefficient and |J the isothermal com- 
^^^sibility at the temperature 7,., which itself is the temperature at which the disorder 
pf e * "rjies effectively 'frozen in' as the glass cools. 
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FIG. 8.32 Typical attenuation versus wavelength plot tor a mIk a- based optical fiber. The contribution 
from Rayleigh scattering is shown, as are the other two main loss mechanisms, namely the infrared absorp- 
tion tail and the hydroxy! (OH ) absorption peaks. 



EXAMPLE 8.1 1 Rayleigh scattering loss 



We estimate the contribution that Rayleigh scattering makes to the attenuation in a 
silica fiber at a wavelength of I mm Taking the values >t=lA5. /> = 0.286, 
P = 7x 10"" rrrivr 1 and T f = 1400 K. we obtain from eq. (8. 38) 



OL a = 



3(io-y 

1.79 x 10~ 4 m 



x(1.45) K x7 x 10-" x \Mx 10 : ' x 1400 



Over a length L of fiber, an absorption coefficient of n H will give rise to a loss of 
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— 10 log m [exp(-a R Zj] dB. Over a kilometre of fiber, we should then expect a loss oi' 
-lOlog IO [exp(-1.79x K)~ 4 x HI')] dBkm -1 or 0.78 dB km" 1 . 

Lower attenuations will be obtained at longer wavelengths. At 1 .55 jam. for example 
(where fiber attenuation is at a minimum), we would expect the absorption to be a factor 
1_55 4 or 5.8 smaller, that is a R = 3.l x Ifl -5 m" 1 which is equivalent to 0.13 dBkm ! . 

ABSORPTION LOSSES 

Absorp'i orl losses in the visible and near-infrared regions arise mainly from the presence 

f irnp ur ^' es ' particularly traces of transition metal ions (e.g. Fe u , Cu 2 ''") or hyciroxyl ions 

,r^\\~)- The i atter are responsible for absorption peaks at 0.95 urn. 1 .24 jam and 1 .39 jjm. Most 

f the dramatic successes in reducing fiber losses came about because of better control of 

ur jty concentrations. 

At wavelengths greater than about 1 .6 jam. the main losses are due to transitions between 

.-. ra tional states of the lattice itself. Although the actual fundamental absorption peaks occur 

wavelengths well into the infrared (in SiCh. for example, the main peak is at 9 urn), the 

. a nnonic nature of the interatomic forces gives rise to combination and overtone bands: 

t js, an incoming photon can simultaneously excite two or more fundamental lattice vibra- 

• ns ( or phonons). Thus, a number of strong absorption bands extend all the way down to 

u t 3 jam w i tn appreciable absorption still occurring below 2 urn. To date, some of the 

3 eS t attenuations have been obtained with GeO : -doped SiO, fibers (ref. 8.15), which have 

^n minimum attenuations of about 0.2 dB km" 1 at 1.55 urn. This figure is quite close to 

S limit set by Rayleigh scattering (see Example 8.1 1 ). A typical attenuation curve is shown 

in F'S' 8J2 



C0>' 



Optical fiber connectors 



1 Single fiber jointing 

. obviously impractical to assume that the correct lengths of fiber will always be avail- 
i" for every situation and hence we need to be able to join fibers together in a manner which 
s rise to minimum light loss. As well as permanent connections there may also be occa- 
& 1 c when demountable connections are required. 

sJ p eI -haps the simplest technique for permanently joining fibers together is to use a fusion 

. e . Here (Fig. 8.33) the fiber ends are held in close proximity and heated up to the mater- 

S P , ftening point (about 20()()°C for silica). The fibers are then pushed together and the 

' ends fuse to form a continuous length of fiber. Any slight initial misalignment of the 

t w en ds tends to be corrected as surface tension forces pull the fibers back into alignment. 

finis nec ' join will be mechanically weak since the protective coating layer has to be 

^ ved before the process starts. To provide protection it is usual to slide a small metal 

r ettl oV er the join and seal it onto the fiber using epoxy resin. Both muitimode and single 

t ut, *L fibers can be joined by this means and transmission losses can be less than 0.1 dB. 

r^'ivlany other types of jointing, both permanent and demountable, involve a bun join whereby 

tv vO fiber ends are held in close proximity. There are two types of loss associated with 

tfr e 
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FIG. 8.33 



in fusion splicing (he fiber ends are pushed together whilst beini; heated within an electric arc. 



this: Fresnel losses, caused by back reflection at the silica/air interfaces, and losses associ- 
ated with any misalignment of the end faces. The Fresnel losses are readily estimated: if we 
assume light hits the fiber end at normal incidence then the fraction of light, R ? , reflected 
back at each fiber end is given by 



R F = 



n, +n 



(8.39) 



where n () is the refractive index of the medium between the fibers and n l that of the fiber 
core. The total transmission for each face clue to Fresnel reflection is then given by 



7V=1-/? F 



(8.39a) 



Since we have two interfaces the total loss will then be (/?,.) : . Usually this amounts to a few 
tenths of a decibel (Example 8.12). 



EXAMPLE 8.12 Fresnel losses at fiber joins 



If we take a fiber core with refractive index 1 AH and assume that the medium between 
the fibers is air (/j = I ), then R v is given by 

'1.48- IV 



Rt 



1.48+ 1 



= 0.0375 



Thus 7" F = (1 -0.0375) 2 or 0.926. This corresponds to a loss of -10 log u) (0.926) or 
0.33 dB. 



The second type of loss associated with butt-type connections arises from misalignment 
between the fibers. There are three basic types of misalignment to be considered: (a) the dis- 
tance between the fibers along the fiber axis (lotii-itiu/iiuil): (b) the offset distance perpen- 
dicular to the fiber axis {kiterul); and (c) the angle between the two axes of the fibers 
{angular). These are illustrated in Fig. 8.34. For both multimode and single mode fibers the 
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(a) 



(b) 



^17 



(O 




ftG 



8- 34 illustration of the three types of misalignment mentioned in the text: (a) iongitudin.il, (b) Literal 



■ ( C ) angular misalignments. The parameters used to describe these (5, D and a) are also shown. 



ost important of these is usually lateral loss. If we assume that step index multimode fibers 
ve a uniform irradiance across the fiber ends then it is fairly easy from a .simple determi- 
t jon of the end face overlap area to show that the transmission losses resulting from lateral 
. a lignment can be written (see Problem 8.17) 



7V - 



cos 




1/2) 



(8.40) 



\V 



her e D is the lateral displacement and a the fiber core radius. Figure 8.35 shows typical 

u j ts for lateral and longitudinal misalignments in step index multimode fiber. 

tC ^ e corresponding results for single mode fibers (and indeed for graded index fibers) tend 

v,^ rather more difficult to obtain; for a summary the interested reader is referred to ref 
to DC 

8-1 6 - 



8.13 Transmission loss from lateral misalignment 



g*AM pLE 

y/e consider a multimode step index fiber where we have a lateral misalignment which 
alTl ounts to some 1 0% of the fiber diameter. That is, D/2a = 0. 1 . From eq. (8.40) we have 



r B -; 



|cos-'(l).l>-0.1[l -(0.]) ! ] l/! 1=0.872 
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The transmission loss is therefore -10 lou ,,,(0.872) = 0.59 dB. 

This calculation excludes any Fresnel losses (Example 8.12): if these are included 
the total loss becomes 0.59 + 0.33 = 0.92 dB. 



As far as making permanent joins, which rely on butting fibers together, is concerned there 
are three main steps. First the two end faces of the fibers are made as flat and as perpen- 
dicular to the fiber axis as possible. Secondly the fibers are held in some sort of jig which 
allows them to be closely aligned. An example of this is the V-groove arrangement of Fig. 
8.36: another possibility is to put the fibers inside a closely fitting capillary tube. The final 
step is to fix the fibers permanently in place, usually by the use of an epoxy-resin-type glue. 

When it comes to butt joining demountable connectors, there is a wide variety of differ- 
ent types available from various manufacturers. One example is the biconical type illustrated 
in Fig. 8.37; other types are described in re I'. 8.17. With all types of butt joins it is possible 
to reduce the contribution from Fresnel loss by filling the gaps between the fiber ends with 
a fluid or grease which has a refractive index as close as possible to that of the fiber core. 
However, in practical jointing situations the problem of dirt being picked up in the fluid often 
outweighs the potential advantages. 



4 i- 



Loss (dB 




4 5 

Normalized misalignment 

FIG. 8.35 Ettetts ol scp.ir.it ion .mil oltset miMinMinietits mi liber t en met tit >n losses in multimode fibers. 
The misalignment-, .ire expressed m terms.)! tin- nomult/fd p.uametets s-,i .longitudinal! and D/a (lateral), 
where a is the liber tore r.iduis Die residual less when tin- misalignment is letlut ed to zero is due to 
Fresnel losses. 
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Fiber 




,s- 8.36 illustration ot the use ot a V-groove tor the alignment of fiber ends prior to jointing. 




.37 Schematic diagram ot a bkonieal demountable connector for butt joining optical fibers 



typical values for loss in multimode connectors can range from a few decibels down to 

f e w tenths of a decibel depending on the degree of precision in the alignment (and usually, 

■ consequence, the cost). Losses in single mode connectors are generally somewhat hi»her 

t-^n in their multimode counterparts - not surprisingly in view of their much smaller core 

dia meters. 

a type ot demountable connector that does not rely on a butt join is shown in Fig. 8.38. 
-rhi* uses a heani ex P an(ler ' n wn ' cn eacri half of the connector contains a lens with the fiber 
tJ at the focal point. Light emerging from one end of a fiber becomes a relatively broad, 
6 ilimated beam in the region between the lenses and is then focused back onto the other 
£t,er end. Since the expanded beam width can be considerably larger than the fiber diame- 
josses due to offset misalignments between the two halves of the connector are now con- 
^■derdbly reduced. Great care is needed, however, in the initial alignment of the fiber with 
• s ' ,, rC i to the lens in each half of the connector. An advantage of this tvne oi 



reS 



Age ot this type of connector is 
r ^ t further optical components (e.g. beam splitters) can be inserted into the collimated beam 
f vveen the lenses. 
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FIG. 8.38 Fiber coupling along a beam expander. The fiber ends are situated at the focal points of the 
two lenses resulting in an expanded coIIim.Ucd beam between the lenses. 



We turn now to the problem of obtaining the requisite smooth, flat ends to the fiber. One 
of the simplest ways involves cleaving: the fiber is bent under tension over a curved surface 
whilst a knife edge 'nicks' the fiber (Fig. 8.39). This initiates crack formation in the fiber 
which, because of the curvature and tension in the fiber, rapidly spreads across the fiber in 
a direction perpendicular to the axis of the fiber. If carefully done end faces which are within 
a degree or so of the perpendicular to the fiber axis can be obtained. A more traditional tech- 
nique is to polish the ends mechanically whilst they are held in a suitable jig. Again good 
accuracy can be obtained, but the process can be quite time consuming. Before either of these 
techniques is carried out the outer plastic coating on the fiber must be removed, either by 
using some type of mechanical stripper or by chemical means. 



8.5.2 Fiber couplers 

A problem arising in some applications of optical fibers is how to share a signal from one 
or more inputs amongst several output fibers. Often there arc as many input fibers as there 
are output fibers. In this case the device is usually symmetrical and may be operated in 're- 
verse . with the output becoming the inputs and vice versa. A schematic illustration of a 
four-port device is shown in Fig. 8.40. Here, an input to either of fibers 1 or 2 will result 
in an output from fibers .1 and 4. The way in which the power from the input fiber is dis- 
tributed amongst the output fibers is governed by the insertion loss /.,,,„. where L ins is defined 
by 

Z, ns = -10Iog H) (/>//>,) (8.41) 

Here P, is the power flowing in the /th input fiber and P l is the power flowing in the/th 
output fiber. P f will always be less than P t for two reasons: first because of the splitting of 
the input power amongst the output fibers and secondly because there will inevitably be some 
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V 



Knife edge 




/ ^Tension 

f 8. 39 Illustration of one technique used to prepare plane fiber ends normal to the liber axis. I'lic libei 

i ficl i n tension over a cursed surface. Then the surface is scored with a tungsten tarbide or diamond 
is" 

( e edge. This initiates crack tormation and, provided the tension and radius oi curvature are set t or- 



re 



, -tiV» a srn 00 ^ break is obtained 



inp 




fibers 



2 w 



8.40 



Schematic representation ot a tour-port (2 x 2) fiber coupler. Power (lowing in either fiber 1 or 



II k e distributed, in some predetermined way, into fibers 5 



and 4. 



of optical power within the device itself. The total power loss is described by the so- 
10 Ued excess loss (L c ) where 



ca 






(8.4: 



ther term llsec ' ' s me " N! I )Ut I'otio. This describes the nominal insertion loss from one 

,r fiber to each of the output fibers (as in eq. 8.41) assuming there is no power loss in 

irl ^ ■ -..:„<* ]t thus represents an ideal situation rather than the actual performance of the 



A" 



th 



device. 



A. simp' e wu - v of " ii"P^mcnting such a coupler for multimode fibers is to butt the input 
s u p against the em! of a 'mixing rod'. This is essentially a short length of large diam- 
f' x ptical fiber. The output fibers are similarly butted up against the other end of the rod 
ete 8.41a). The rod must be long enough to ensure that the output from each input fiber 
nine- s sufficiently diffused within the rod to give an even spread of energy over the 
t> e t face. To avoid excessive loss, it is best if the fibers can be 'close packed' over the 
oVit F f the rod. a situation that can only be achieved for certain numbers (e.g. 7, 19, etc.) 
efl - nut fibers (see Fig. 8.4 1 b). A calculation of the losses resulting from this type of coupler 



is 



,X.I4. 
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(b) 

FIG. 8.41 Design for a 14 port (7x7) fiber coupler using a 'mixing rod'. The overall scheme is shown in 
(a), whilst (b) shows how seven fibers can form a Ylose-pa< keel' arrav across the f<ice of the mixing rod. 

EXAMPLE 8.14 Fiber coupler losses _____ 

We consider the rod-type coupler illustrated in Fig. 8.4 1 (a) with seven input and output 
fibers, each having core and core + cladding diameters oft/, and </, respectively. If the 
mixing rod has a diameter of 3d 2 . the close-packed arrangement of Fig. 8.41(b) can 
be obtained. Losses will inevitably result because the cores of the output fibers have 
a combined area that is less than that of the end of the mixing rod. Assuming a uniform 
spread of light from any one input fiber over the end of the coupling rod and no losses 
in the coupling rod, we may write P, = Bn(3J 2 ) 2 /4 whilst P r Bml]JA, where B is a 
constant. Thus from eq. (8.41) the insertion loss is given by 



Z. ms = -10Iog 1() j 



/ Bnd]/A 



or -IQIoiz, 



J: 



\BKOd 2 ) 2 /4j S1, \wi t 

Taking d t = 200 um and d 2 = 250 urn gives /. 1IK =11.3 dB. 
Similarly the excess loss is readily seen to be given by 

whence L c = 3 dB. 
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Another technique used to make multimode fiber couplers is to twist the fibers around 
each other and then to heat them under tension in an oxypropane torch flame. The bundle 
fuses together and then elongates to form a device known as afused biconical taper coupler. 
The coupling mechanism is believed to be as follows: when light from one fiber enters the 
fused region, the narrowing core causes mode conversion into cladding modes. These sub- 
seq uently s P read out acr oss the whole of the cross-section in the fused region. Then, when 
the fibers separate, the cladding modes are convened back into core modes, but with the 
energy now distributed amongst all the fibers. Such couplers exhibit low coupling losses 
but the coupling properties between the various pons may vary. 

Single mode fiber couplers can also be made by the same technique, though the coupling 
jnechanism is thought to be somewhat different. When the radiation encounters the nar- 
rowing fiber core, the effective V parameter of the fiber will decrease (since V<* d/X {) ) and 
t^e mode field will extend further out into the cladding (e.g. see Fig. 8.25) so that it over- 
laps with the other fiber cores, and hence causes energy to be coupled into them. This is 
illustrated in Fig. 8.42. 

j3y using the so-called coupled mode theory (ref. 8.28) it may be shown that the fraction. 
c of tne ener gy initially in one of the fibers which is coupled into the other can be written 

f^sin 2 (Cz) (8 43) 

w her e z is the distance over which the fibers are coupled and C represents the strength of 
the coupling between the fibers. The latter should depend on the spread of the evanescent 
f eld which in turn depends on the V parameter: the smaller the value of V the greater the 



sp 



re ad. Since V is inversely proportional to the wavelength we may conclude that the 
ng er the wavelength the stronger the coupling. Thus C will be some increasing func- 
ofl of wavelength, C(X), which we assume we can write as bX q , where b and q are 
" on slants, so that 



F 



= sin 2 (^'r) 




overly 



8-4 2 Coupling mechanism in a single mode biconical taper coupler involving evanescent mode 
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The coupler will exhibit a quasi-oscillatory behaviour as a function of wavelength, thus 
emphasizing that an intended coupling ratio will only be obtained at a particular operating 
wavelength. 

One of" the most useful single mode couplers is a four-port device where a given input 
signal is split equally between two output fibers. Assuming no losses in the coupler, the inser- 
tion loss is 0.5 or 3 dB. Such couplers are referred to as 3 dB couplers. 



Measurement of fiber characteristics 



8.6.1 Introduction 

In this section, we describe techniques used to measure those properties such as loss and 
bandwidth which are important in optical fiber communications. Measurements of purely 
mechanical properties are not included here for reasons of space (ref. 8.18 may be consulted 
tor a fuller discussion of all types of measurement). Before addressing ourselves to specific 
techniques, however, it is useful to understand the importance of obtaining the correct launch 
conditions. 

Suppose we illuminate the whole of a fiber end with a uniform, highly convergent beam 
of light. All modes in the fiber will then be excited, including leaky and cladding modes. If 
we were to examine the mode structure a considerable distance along the fiber, almost all 
of the lossy modes would have died out. In addition, mode coupling would have thoroughly 
scrambled' the remaining modes to generate an 'equilibrium' mode distribution that would 
then remain more or less the same throughout the rest of the length of the fiber. In princi- 
ple, such a distribution would be obtained whatever the launch conditions, provided we make 
our observation of the mode distribution sufficiently far along the fiber. In practical terms, 
however, this might amount to several kilometres. Most of the properties we are concerned 
with here do indeed depend of the mode distribution, so that it makes sense to try and measure 
these for the equilibrium distribution. On the other hand, it may be very inconvenient if many 
kilometres of fiber have to be used to obtain this distribution. It would obviously be prefer- 
able if the launch conditions were such as to set up equilibrium right from the start. 
Unfortunately this is not at all easy to do. although several procedures can be adopted that 
go some way towards realizing this goal. For example, to avoid launching cladding and leaky 
modes care must be taken to illuminate only the core area with radiation whose angle of inci- 
dence is less than the acceptance angle of the fiber (as given by eq. 8.2 I ). 

A device that aims to produce an equilibrium 'mix' of modes over a short length of fiber 
is the mode .scrambler. In this, a large amount of mode coupling is deliberately introduced 
by means of bending or microbending distortion (see section 8.4. 1 ). For example, a few turns 
of the fiber may be wrapped round a post or mandrel a few tens of millimetres in diameter 
(Fig. 8.43a). Alternatively, the fiber may be passed through a series of posts to produce a 
serpentine distortion (Fig. 8.43b). A simple variant of the latter technique is to sandwich the 
fiber between two layers of coarse sandpaper over a length of some tens of millimetres. 

The use of a mode scrambler may itself cause cladding modes to be excited, and these 
will have to be removed subsequently. One device designed to remove cladding modes is 
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(a) 




Fiber 



(b) 




Fiber 



FlC 8 - 43 Technic l ues ,or producing an equilibrium mode distribution, in (al the 'mandrel wrap' method, a 
f ev v turns of fiber are taken around a rod. In <b), the fiber makes a serpentine path through a series of posts. 

called a mode stripper, which may be simply constructed by raking a short length of fiber 
that has been stripped of its outer protective coating and pressing it against a felt pad that 
has been saturate(1 m °il (such as paraffin oil) which has a slightly higher refractive index 
than the cladding. Any light incident on the cladding/oil interface will then be refracted out 
f the fiber and absorbed within the felt pad. 



Fiber attenuation measurements 



8.6.2 

. t jts simplest, the measurement of attenuation involves focusing a known amount of mono- 
c hr°rrmtic radiation onto the end of a known length of fiber and then measuring the amount 
urging from the other end (Fig. 8.44). Use of eq. (8.37) then enables the fiber attenua- 



*m< 



^\\cn coupling light into the fiber, although for fibers with high attenuation, trm 
ce ptable. With low loss fibers, however, the cutback technique is generally u 



tjon to be deterrnined - Radiation at different wavelengths can be obtained by using a white 
ligh 1 source, such as a quartz halogen lamp, in conjunction with either a monochromator or 

, s gt of narrow bandpass optical filters. This method ignores the inevitable losses incurred 

s may prove 

, c c e P iauit ' " ll " ,wvv "'•■" Ml,tIJ, < nuwcvci. ine ciunacK tecnnique is generally used. In this, 
jjght ' s Punched as in the simple technique and the emerging power measured. Then, without 
c h& n S ing the Iaunch conciitions in an y wa y- the fiber is cut back to within a few metres of 
th e l aunch end ' and Ihe emer e in g ener gy again measured. The fractional transmittance of 
th e leng th of fiber removed is then given by the ratio of these two measurements, since the 
, aU nch energy is the same in both cases. As outlined in the introduction to this section, care 
st be taken with the launch conditions to obtain an equilibrium mode distribution (i.e. 
HV using a mode scrambler/mode stripper combination). 
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FIG. 8.44 Schematic layout of apparatus for measuring attenuation in optical fibers. To enhance the 

signal-to-noise ratio, an optical chopper-lock-in amplifier combination is often used. 

The above techniques measure the total attenuation within the fiber, which, from a prac- 
tical point of view, is usually all that is required. However, it may be that the individual con- 
tributions from absorption and scattering need to be assessed separately. Absorption losses 
may be determined by measuring the temperature rise within a given length of fiber, whilst 
scattering loss measurements depend on determining the amount of light scattered out of 
the fiber (see Problem 8.18 and also ref. 8.19). 



8.6.3 Fiber dispersion measurements 

When a narrow optical pulse is launched into a fiber, the various dispersion mechanisms 
discussed in sections 8.3.2 to 8.3.5 will cause the pulse to broaden as it travels down the 
Tiber. In the so-called time domain measurement of dispersion, this broadening is measured 
directly. A narrow optical pulse may conveniently be generated by using a semiconductor 
laser in conjunction with fast driving electronics. Pulses with widths of a few nanoseconds 
or less may be readily obtained. If the main source of dispersion is intermodal dispersion 
then, as in attenuation measurements, an equilibrium mode distribution will have to be 
created. The detector used must obviously have a fast response and often avalanche photo- 
diodes are used. Since the pulses may be too narrow to be displayed easily on an ordinary 
oscilloscope, a fast sampling oscilloscope may be required. 

Ideally, the input pulse should be narrow compared with the output pulse so that it can 
be thought of as a 'spike' of negligible width. The shape of the output pulse is then called 
the transfer function (see Appendix 5). Normally, however, the input pulse has appreciable 
width and determining the transfer function becomes more difficult. A simplification 
often adopted is to assume that the input pulse, transfer function and output pulses are all 
Gaussian in shape (see Fig. AS.Ib); that is, we can describe each curve by the function 
A(r) = A(0) exp(-r/2o 2 ), where time is measured from the peak of the curve. The parame- 
ter o" essentially determines the r.m.s. pulse width. If rr,, cr, unci rr, are the values for the initial, 
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transfer and final pulse widths, then it can be shown (ref. 8.20) that 

<yf = al + o; (8.44) 

that is, 

0|as ( 2-of)'-- (8.44a) 

j\n alternative method of measuring dispersion is to take measurements in the frequency 
, orn ain. An emitting device (a laser or LED, for example) is amplitude modulated with a vari- 
ble frequency sinusoidal waveform, and the output from the fiber is recorded as a function 
f frequency. This allows the frequency bandwidth of the fiber to be measured directly. In 
fact two bandwidths are commonly used: the optical bandwidth Af opr and the electrical band- 
width Af cl . The former is defined as the frequency at which the signal falls to half its maximum 
vv frequency value, the latter as the frequency when the signal falls to l/v2 of its maximum 
a lue. Appendix 5 should be consulted for a discussion of the relationships between pulse 
j- s persion and bandwidths. For a Gaussian pulse, the electrical bandwidth and pulse width 
are related by 
1 

A/el = 



Although the two techniques above enable the total dispersion exhibited by a fiber to be deter- 

'ned it is evident that since material dispersion is such an important pulse broadening mech- 

s m that it deserves a more detailed investigation. The most usual technique is to measure 

a time taken for a pulse to traverse a known length of fiber as a function of the wavelength 

f the source generating the pulse (this assumes we have a variable wavelength source, or 

° eries of sources of differing wavelengths). This then determines how the group transit 

a e (x ) depends on \ . The gradient of this relationship then gives the quantity dx g /dX as 

11 function of A,,,. This, of course, is the quantity which is required in eq. (8.34) to determine 

Ise spreading due to material + waveguide dispersion. 



f- 4 Cut-off wavelengths in single mode fiber 

rry.r cut-off wavelength. X. L „ is the wavelength in a fiber below which it ceases to behave as 

- gje mode fiber. One of the simplest (but not necessarily the most accurate) techniques 

3 measure A. L . is to launch light of variable wavelength obtained from, say, a broad band radi- 

* - n source and a monochromator into the fiber, and measure the transmission of the fiber 

3 function of wavelength. With two modes travelling in the fiber the amount of radiation 

a t can be transported along the fiber is roughly doubled compared with when only a single 

Ae can propagate. Thus the transmission curve shows a sudden increase below the cut- 

^r vvavelength \.. as shown, for example, in Fig. 8.45. Allowance should be made for the 

° te m (i.e. source/monochromator/detector) response. Care must also be taken to ensure 

t the fiber is not unduly bent, since the higher order modes show heavy bending losses 

r n they are near cut-off. and may not be able to propagate at all if the fiber is bent too 

^ c h. In fact a related method of determining \. is to compare the transmission of the same 

171 e th of fiber between when it is straight and when it is bent. This technique obviates the 

* C d to allow for system response. 




410 FIBER OPTICAL WAVEGUIDES 



Fiber 

transmission 




Wavelength 



FIC. 8.45 Diagram of the relative amount of power transmitted through a single mode fiber as a function 
of wavelength. There is a sudden change in the transmission at the cut-off wavelength, \. 



8.6.5 Refractive index profile measurement 

Several methods are available for measuring the fiber core refractive index profile, none of 
which, however, is entirely satisfactory. Probably the most accurate method is the interfero- 
metric slab technique. In this a thin slice of fiber of thickness d is taken perpendicular to 
the fiber axis. The thickness must be uniform to within a fraction of the wavelength of the 
light used. Light passing through the slab will normally 'see' an optical thickness of n{r)d, 
where n(r) is the refractive index a radial distance /■ from the fiber center. The slab is then 
placed m the arm of an interferometric system (see section 6.6.1). The non-uniform optical 
path differences through the slab then show up as distortions in the uniform fringe pattern 
observed in the absence of the slab. Using this technique, the refractive index can be mea- 
sured to better than one part in 10 4 . the main limitation being the accuracy with which the 
thickness of the slab can be measured. The disadvantages are that it requires precise and time- 
consuming sample preparation and it is also, of course, destructive to the fiber. 

A non-destructive technique capable of yielding reasonably accurate results is the near- 
field scanning method. If all the guided modes are uniformly excited then the variation of 
light irradiance across the end of the fiber may be written (ref. 8.21) as 



/('•) = /(0) 



''?(0)-h; 



(8.45) 



Here I(r) is the irradiance observed a distance /■ from the center, and n : (r) and >u are the 
core and cladding refractive indices respectively. If any leaky modes are present, however 
a correction factor has to be applied (ref. 8.22). The apparatus requirements are reasonablv 
straightforward. One end of the fiber is illuminated by focusing a Lambertian source (e g 
a tungsten filament lamp) onto it. This ensures that the guided modes are all excited equally 
A magnified image of the other end of the fiber is formed in the plane of a small area pho- 
todiode, which is then scanned across the image allowing /(;■) to be determined. 

The two methods described here are by no means the only ones that can be used to deter- 
mine refractive index profiles; other techniques will be found in ref. 8.23. 
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8.6-6 Optical time domain reflectometer 

All the measurement techniques described hitherto are essentially laboratory based and would 
be difficult to implement in the field. Furthermore, they often require access to both ends 
of the fiber, a situation causing not a little difficulty if the fiber ends are several kilometres 
p 3 rt! One instrument capable of yielding considerable information about an optical fiber 
s ystem, and yet which requires access to one end only of the fiber, is the optical time domain 
r eflectometer (OTDR). 

The basis of this instrument is that a narrow pulse of radiation is launched into one end 
f t he fiber, and then the amount of radiation re-emerging from this same end is monitored 
s a function of time after the initial launch. As the pulse travels down the fiber, it will suffer 
attenuation so that after a length L of fiber the pulse energy will be given by E(L). where 

EiL) = £(0) exp(-aZ.) 

^ere a is the absorption coefficient. At every point along the fiber, a certain portion. S H 
a y, of the pulse energy will be lost by Rayleigh scattering. Of this a fraction, /j, (see Problem 
U 21), will enter guided modes propagating back down the fiber. The amount of energy scat- 
tered back down the fiber may therefore be written S R f^E(L). This backscattered radiation 
w iU itself be subject to attenuation as it travels back to the launch end of the fiber. The energy 
finally re-emerging owing to backscattering from a point a distance L into the fiber is given 
by E b (D, where 

E b {L) = S K f g E(L)exp(-aL) 



or 

£ b (L) = 5 R / (? £(0)exp(-2aL) 

Thus 

log e EJL) = IogJS R / s E(0)] - 2aL < 8 - 46 > 

TS j w the time delay, t, between the initial launch of the pulse and the subsequent return of 
y. e backscattered energy from L may be approximated by / = 2LnJc, so that we may trans- 
form eq. (8.46) to give 



OCl 

lo g c £ b (Z.) = Iog,[5 K ,/;£(0)]-- 



act 

i 



(8.47) 



•Thus a plot of the logarithm of the returning energy as a function of time should yield a 
aight ]j ne w j In gradient -ac/n { . If /?, is known, a may be determined. 
In this analysis, we have assumed a perfectly uniform fiber as far as a, S R and/,, are con- 
, ern ed. I" practice, a may well vary along the fiber and discontinuities, such as fiber joins, 
•j] give local increases in the amount of backscattering. For example, a butt join will give 
rnething like a 4</<- backscatter because of Fresnel reflection. These will show up on the 
* r aph as 'spikes'. In addition, the power loss within a join or splice will lead to a corre- 
sponding reduction in the signal backscattered from beyond it. The end of the fiber will be 
parked by a Fresnel spike with the signal falling to zero immediately afterwards. If a does 
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in fact vary along the length of the fiber, the above graph will not be a straight line; however, 
the local gradient should reflect the local value of a. In commercial instruments, the return- 
ing energy is measured in decibels and time is converted into distance along the fiber. Figure 
8.46 shows a typical trace. 

One of the main difficulties in realizing this instrument is that the amount of energy return- 
ing from the fiber is very small, and even under optimum conditions it is difficult to examine 
more than 15 km of fiber. The light source is invariably a pulsed semiconductor laser, whilst 
the returning radiation is measured with an avalanche photodiode. Both to launch light and 
to measure the returning light, some form of beam splitter arrangement is required. Usually, 
the instrument sends out a continuous stream of pulses. After each of these, the amount of 
energy returning after a certain (variable) time delay is measured. The time delay is gradu- 
ally increased, thus enabling the whole curve to be built up point by point. To increase accu- 
racy, several readings may be taken for each delay and an average taken. Typical pulse widths 
are the order of 50 ns with launched energies of a few hundred milliwatts per pulse. Features 
on the curve can be located to within ±5 m. or perhaps a little better. Overall, a dynamic 
range of something like 40 dB (= 2 x 20 dB single way loss) can be tolerated over the length 
of fiber under inspection before the returning signal becomes too weak to measure. 




Distance (km) 



FIG. 8.46 Typical trace from an OTDR. 
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C~8-Z^) Fiber materials and manufacture 

Only two main materials have been seriously considered to date for use in optical waveguides, 
these being various types of plastics and silica. Plastic fibers offer some advantages in terms 
f cost and ease of manufacture, but their high transmission losses preclude their use in any- 
thing other than short distance (i.e. less than a few hundred meters) optical links. 



8.7-1 



Silica-based fibers 



An obvious requirement in any type of fiber manufacture is that the core refractive index 

uS t be higher than that of the cladding. Pure silica has a refractive index of about 1.45 at 

um anci this ma - be Ranged by doping with various impurities. For example, impurities 

hat raise the refractive index are titania (TiO : ). alumina (AUO,) and germania (GeO : ). whilst 

r j a (B,0,) and fluorine (F) lower it. The solubility of such materials in silica is limited 

- rid the maximum change in refractive index is limited to a few percent. Some ot'the highest 

3 rforrnance fibers have been fabricated with germania-doped cores and fluorine-doped 

paddings. 

M ost tecnni( 3 ues currently used to manufacture silica fiber are based on some type of 
olir deposition technique. One of the most popular of these is the so-called modified client- 
V I vapour deposition (MCVD) method. In this a doped silica layer is deposited onto the 
lC er surface of a pure silica tube. The deposition occurs as a result of a chemical reaction 
lfl kin g place between the vapour constituents that are being passed down the tube. Typical 
t3 «ours used are SiCl 4 . GeCl 4 and O, and the reactions that take place may be written 

SiCl4 + 0,->SiO, + 2CI : 
and 



OeCl 



+ 0,-»GeOn + 2Cl, 



zone where the reaction takes place is moved along the tube by locally heating the tube 

, C temperature in the range 1 200- 1600°C with a traversing oxyhydrogen flame (Fig. 8.47). 

t0 he process is repeated with different input concentrations of the dopant vapours, layers 

e ^iffe rent impurity concentrations may be built up sequentially. Once the deposition process 

j^plete, any residual gases are pumped out and the tube is heated to its softening tem- 

lS . ture (~ 2()()0°C). Surface tension effects cause the tube to collapse into a solid rod called 

^ l7 ref orm - 

& r 'fiber ma y subsequently be produced by drawing from the heated tip of the preform as 

lowered into a furnace (Fig. 8.48). To exercise tight control over the fiber diameter, a 

it . ' i, ne ss monitoring gauge is used before the fiber is drawn onto the take-up drum, and feed- 

k is applied to the drum take-up speed. In addition, a protective plastic coating is often 

^ lied to the outside of the fiber (see Fig. 8.48 and section 8.8) by passing it through a bath 

a f i, e plastic material: the resulting coating is then cured by passing it through a further 

e irn ace- 

m other vapour deposition techniques, the preform may be made by growing doped silica 

r , n the outside of a silica rod. The reactant gases, such as SiCl 4 and GeCL, are fed 

iay er 
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FIG. 8.47 



Production of fiber preform by modified < hemic ai vapour deposition. In the first stage (a) the 
reactants are introduced into one end of a silk a tube and the < ore material deposited on the inside of the 
tube in the reaction zone, where the temperature is maintained at about I (,()() C. Several traverses of the 
heating assembly may be necessary to build up sufli. icnl thi< km-* <>l .ore material. In the second stage (b) 
the tube is collapsed into a solid preform rod by heating to the silit a softening temperature (about 2000°C). 



into H 2 /0 2 Hames where a stream of fine doped silica soot particles is formed. In the outside 
vapour dcposnwn (OVD) technique, these are deposited on the outside of a rotating mandrel, 
the growth taking place radially. Alternatively, as in the vapour axial deposition (VAD) tech- 
nique, deposition may take place on the end surface of a silica "seed* rod. Both of these depo- 
sition techniques produce a porous material, and so a further production stage is required 
involving heating the rod to between I(K)()°C and 15()()°C in the presence of chlorine The 
solid preform thus formed may be drawn into a fiber in exactly the same way as described 
above for the MCVD technique. 

8.7.2 Plastic -coated silica fiber 

Fibers are available which have a silica core combined with a plastic cladding. These plastic- 
coated silica (PCS) fibers are somewhat easier to manufacture than all-silica fiber. A preform 
is made consisting of the core material only. The preform to fiber process then proceeds as 
above, the difference now being that the plastic bath provides the cladding material rather 
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3.48 Conversion ol a preform rod into a fiber. The end of the rod is heated to its softening point and 



fiber drawn ott imm the tip and wound onto a take-up drum. The fiber is usually coated with a layer of 



,|astic 



jc for protection belore being wound on the drum. 



han a protective coating. The process readily lends itself to the production of step index nuil- 

-^ode fibers with relatively large core diameters where little of the energy is carried in the 

i-jdding- Such fibers arc attractive for medium distance, moderate bandwidth communica- 

n systems, where cost is a major consideration. Typical losses are of the order of 

Vq dBkm" 1 . These values are significantly higher than those in all-silica fibers because. 

, jth° u £ h very Iittte of the eiler S>' traveIs m tne cladding, that which does is subject to much 

2; P-frer attenuation. 

filer 

— -x All-plastic fibers 

ibers can also be made entirely from plastic materials, a popular combination being a 
of e of polymethyl methacrylate (PMMA. n= 1.495) and a cladding of nuoroalkyl 



416 FIBER OPTICAL WAVEGUIDES 

methacrylate (/; = 1.402). Compared with silica fibers losses in such fiber are high (Fig. 
8.49) but the basic causes are the same: there are strong absorption bands associated with 
vibrations of the C-H bond and also high Rayleigh scattering from the long chain mol- 
ecules. It will be seen from Fig. 8.49 that transmission windows exist at 570 nm and 
650 nm. Although attenuation is in fact lower for the former wavelength, plastic fibers 
are almost always used in conjunction with red-emitting LEDs. This reflects the fact that 
these have a higher output and a superior frequency response than green LEDs. The ulti- 
mate attenuation in plastic fibers is not known with any great certainty, although values 
as low as 10 dB km -1 have been estimated. Considerable effort is being put into lowering 
the attenuation in these fibers (ref. 8.24). 

In spite of their (current) high attenuation all-plastic fibers do offer some significant advan- 
tages when used over distances of a few hundred metres or so. Because of their increased 
flexibility they can be made with diameters considerably greater than silica fibers (up to 
3 mm, although 1 mm is more normal). This makes coupling fibers together much easier, 
and simple injection moulded connectors can be used. Fiber end preparation is also very 
straightforward: simple cutting with a sharp blade is usually adequate. In addition the rela- 
tively large differences between core and cladding refractive indices give rise to high numer- 
ical apertures (typically 0.5), which makes it easy to couple light in from sources such as 
LEDs. Techniques have also been developed for making graded index plastic fibers which 
offer the possibility of very high bandwidths over short distances, but these are not, at the 
time of writing, available commercially. 

One of the problems with PMMA-based fiber is that it can only be operated in environ- 
ments below about 70°C. This precludes its use in, for example, the automobile engine for 
monitoring purposes etc. Other alternatives, for example polycarbonates, are being inves- 
tigated, although these usually exhibit higher attenuation. 
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FIG. 8.49 Typical attenuation versus wavelength plot for jn .lll-pl.iMH fiber. 
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« 7.4 Mid-infrared fibers 

-Tfrie performance of silica-based fibers has been dramatically improved over the last few years 
to the stage where, over a wide wavelength range, the theoretical attenuation limit given by 
p^ a yleigh scattering has. to all intents and purposes, been reached. As we have seen the lowest 
attenuation (about 0.2 dB km" 1 ) occurs at a wavelength of 1.55 urn. This is as a result of the 
presence of Rayieigh scattering, which declines with increasing wavelength (as X " 4 ), and 
vibronic absorption, which produces a sudden rapid increase beyond a particular wavelength. 
Xf materials could be found where the vibronic absorption peaks occurs further into the 
infrared and where Rayieigh scattering is at least as low as in silica, then there would be the 
oS sibility of obtaining fibers with even lower minimum attenuations than are possible at 
present. For example, if the vibronic absorption in silica set in at 3.2 urn rather than at 1 .6 urn. 
then the minimum attenuation should be a factor 2 4 or 16 smaller than in silica, that is about 
n 2/16 or 0-0^25 dB km" ! . Figure 8.50 shows the theoretical attenuations for a number of 
promising materials. 

Interest has centred mainly round two types of material, namely the heavy metal fluoride 
s y S tems and the chalcogenide glasses (ref. 8.25). One of the most studied examples of the 
.- j-pier is ZrF 4 -BaF 2 -LaF,-AlF,-NaF or 'ZBLAN'. So far minimum attenuations of between 
1 dB km -1 and 2 dBkm" 1 have been obtained at a wavelength of 2.5 urn. These are about two 
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orders of" magnitude larger than the theoretical minimum: the main problem would seem to 
be .scattering from submicrometre scattering centres such as chimps of impurities and bubbles. 

Chalcogenide glasses consist of compounds formed between one of the elements Ge, Si. 
As, Sb and one of the elements S. Se, Te. an example being As-,S-,. These offer the prospect 
of low attenuation at yet higher wavelengths (up to. say. 10 um). Here again the actual trans- 
mission losses obtained in practice are orders of magnitude higher than the theoretical values. 
In spite of a sustained worldwide research effort, it has not been possible so far to find any- 
better material for optical fibers than silica. Current opinion would seem to be that as far as 
long distance communication links arc concerned we will have to make do with silica for 
many more years to come. It should also be pointed out that if such research is successful, 
then new detector and emitter materials would have to be developed. 

Not all uses of fiber, however, require long lengths; there are many instances where trans- 
mission of mid-infrared radiation over just a few metres is all that is needed. One example 
of this is the use of the CO-, laser in surgery where the availability of a flexible beam deliv- 
ery system such as the optical fiber is extremely useful (see section 6.8.2 and ref. 8.26). 

8.7.5 Special fiber types 

8.7.5.1 Polarization-maintaining fiber 

Although we frequently talk of single mode fibers, in fact two orthogonally polarized modes 
can usually propagate equally well down ordinary single mode fiber. In a circularly sym- 
metrical fiber, these two modes will have identical propagation velocities. However, real 
fibers will not be perfectly symmetrical; there are bound to be slight anisotropics in both 
shape and refractive index. The result is that the two modes will inevitably have slightly dif- 
fering velocities. In other words, the fiber is to some extent birefringent (section 3.2). The 
birefringence may result either from the intrinsic properties of the fiber (intrinsic birefrin- 
gence) or from external perturbations such as bending or squeezing (extrinsic birefringence). 

Let us suppose that we launch equal amounts of energy into the two orthogonally polar- 
ized modes of a single mode fiber. As the modes travel along the fiber their phase relationship 
will change (Fig. 8.51). but after a certain length (called the beat length) the phase differ- 
ence reaches In and the original input polarization state recurs. 

Suppose the velocities of the two modes can be written as c//? t and c/n y where » ( and n v 
are the effective refractive indices for the two modes. After a distance /..the phase differ- 
ence between the modes A<t), is given by 

M P = (»,- «,.)—- 

A.,, 

The beat length L is then given by the condition that A<t> = 2ji; hence 



L = 



p 

A.,, 



"-(„,-„,.) (8-48) 

Typical 'circular'-cored single mode fibers have beat lengths in the range 1 00 mm < L , < 5 m. 
Now. modes that have very .similar propagation constants can be coupled together by fiber 
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f\0- 8 - 51 F)ber birefnn S cnce c < llJS « a P h ase shift (A<p = (2n/A)dz) to develop belwrni the mudc pol.ir- 
• zeC i along the y direction (heavy shading] and that polarized along the x direction flight shading) .is the 
,,des (ravel along the fiber in the z direction. 



perturbations which have relatively long periodicities (see eq. 8.36 and subsequent discus- 
sion)- Sucn perturbations are difficult to avoid and hence coupling between the two orthog- 
onally polarized modes becomes a likely possibility. Thus, if one polarization state is 
launched into one end of a fiber, mode coupling will ensure that some of this mode energy 
_jll be coupled into the orthogonal mode. However, if the fiber has a large birefringence. 
the two P°' ar ' za, ' ori modes can be coupled only by very short period perturbations, which 
, r e much less likely to be present. Such fibers are known as 'Hi-Bf fibers and are capable 

f fnaintaining the initial linear polarization state over quite large distances. 
The quality of a polarization-maintaining fiber may be judged by the value of the polari- 
z ati° n " holding P arame,er < /; - which is defined as the amount of polarization mode coupling 

er unit length. Thus if P y (z) and P y {z) represent the two mode powers at a point r, then 




* s sumi n g that h is small, P y { = ) will change little with :, and so P x (:) = h:P v (z). Fibers have 
be en fabricated with h values as small as 5 x 10 -s m" 1 . This implies, for example, that after 
j YXO- and assumin S ^,(0) = 0, PjP= 10% 5 x 10\ which is 5 x irr : or - 13 dB in optical 
^er terms. 

V jvlost polarization-maintaining fibers are intrinsically birefringent and several different 
e cfrniq ues haVe bee " USed t0 P roduce them - AU rel >' on producing a fiber cross-section which 
• s in some way as y mmetric ab out two perpendicular axes. Usually, this asymmetry is built 



in 



in 

to 



J f - "■'■'J 1 • •»• IV-t * _J 1, » I'UUl 

■ nJ .^ the fiber preform before the fiber is drawn. Some examples are shown in Fig. 8.52. 
^jjiptically cored fibers (Fig. 8.52a) have been made with beat lengths of about 1 m. but they 
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FIG. 8.52 Various types of birefringent fiber: (a) shows an elliptically cored fiber, and the others aii induce 
stress within a circular core; (b) has an elliptical cladding whilst (c) and (d) have stress forming regions 
within the cladding. 



are difficult to manufacture. Most Hi-Bi fibers are made by incorporating a localized stress 
region into the fiber. Thus an elliptical cladding (Fig. 8.52b) produces antisymmetric stress 
across the core which leads to birefringence. Other examples are the 'bow tie' fiber (Fig. 
8.52c), which is fabricated using a gas phase etching process during manufacture of the 
preform, to produce stress regions round the core, and the 'side pit* or 'panda' fiber (Fie. 
8.52d). which incorporates borosilicate rods in the cladding. 

It ts also possible to design a fiber such that only one of the orthogonal modes can prop- 
agate at the design wavelength. This is done by choosing the propagation constants so that 
one of the modes is a guided mode whilst the other can only propagate as a leaky mode, and 
is accordingly highly attenuated. 



8. 7.5.2 Bragg fiber gratings 

When germania- doped fibers are exposed to ultraviolet radiation with a wavelength between 
240 nm and 250 nm a change takes place in the refractive index of the fiber. The change is 
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pfC. 8- 53 ^ se f>,r a s ''' cfl P^ dse m ask to form a phase grating in a fiber. The phase mask has ,1 wavelength 
vvhich is twice that required. The two first-order beams from the diffracted UV beam i\m produce a fiber 
t jpg having a periodicity of A, where A = D/2. 
W 



e ]ati ve ^ sma11 (typically less than 10" 4 ) but it is permanent, and can be increased by the 

e f a technique called hydrogen loading. This involves forcing molecular hydrogen into 

he fiber core by means of a high pressure, low temperature diffusion process. The result- 

. g r efractive index changes can be used to create a phase grating, that is a periodic change 

t he refractive index, within the core. There are a number of techniques fordoing this: for 

ar nple, the fiber may be illuminated from the side by a coherent ultraviolet source which 

- passed through a silica phase mask. This causes the beam to diffract into a number of sep- 

t e beams which can be used to 'write' the required pattern onto the core of the fiber, as 

for^ eC 

oi 

(8.49) 

here m is an integer, A the periodicity of the grating and /?, the refractive index of the core, 
•]1 undergo strong reflection when it encounters the grating. Such a structure is very useful 

in 



w n in Fig. 8.53. Typically the grating pattern is between 1 mm and 20 mm in length. Once 
jd the fiber grating behaves in a very similar fashion to the Bragg acousto-optic grating 
f section 3.8. Thus radiation of wavelength X. n which satisfies the equation 

^ = 2/71/1, A 



a number of situations. It can provide a suitable mirror for use with fiber lasers (section 
'jO-D- it can be the basis of a wavelength-selective device in optical fiber communications 



te ms (section 9.3.8) and also as the basis for optical fiber temperature and strain sensors 
eC tion 10.1.2). 



Fiber cables 



he tensile strength of glass and silica fibers, when freshly made, is very high and i 
rfipares favourably with that of steel. However, surface damage caused by handling o 



indeed 
by handling or even 
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atmospheric attack rapidly leads to a decrease in strength. This effect may be greatly reduced 
by adding a coating layer to the fiber immediately after manufacture. The material used must 
provide a good chemical and physical barrier and yet must be fairly readily removable for 
the purpose of jointing. The most widely used material is kynar, a vinylidene fluoride polymer 
commonly used as an electrical insulator. Although this primary coating helps preserve the 
intrinsically high tensile strength of the fiber, it cannot protect it from major mechanical 
damage. Consequently when fiber is used in any but a laboratory environment, it is invari- 
ably contained in some form of cable structure. Care must be taken to avoid the occurrence 
of microbends, since, as we saw in section 8.4. 1, they can lead to significant additional fiber 
losses of the order of 1 or 2 dBknf 1 . 

In many cable designs, the fiber is given a further coating or contained within a tight-fitting 
tube to make the fibers easy to handle during cable manufacture. Care is needed to ensure 
that differential thermal expansion or ageing processes do not again give rise to microbends 
or place undue stress on the fiber. The most successful materials for this have been relatively 
hard ones such as nylon, polypropylene or polyurethane. 



(a) 




Polymer sheath 



Steel strengthening member 



Coated fiber 



Cushion layers 



Polymer sheath 



(b) 




Fiber in cavity 

Steel strengthening member 



FIC. 8.54 Typical cable designs. In (a) the coated libers are wrapped helically round a central strengthen- 
ing member, while (b) shows the B!CC 'PSB' design where the bare fiber lies loosely within a cavity in the 
cable. 
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When incorporated into the main cable structure, the fibers are often wrapped helically 
round a central strengthening member and contained within a polymer sheath (Fig. K54a). 
£>esig ns have also been proposed where the second fiber coating is dispensed with, and the 
fiber lies loosely in a cavity within the cable structure, an example of this being the B1CC 
»pSB' design shown in Fig. 8.54(b). 

Steps are usually taken to exclude water from the cable. This may be done by incorpo- 
r ati n S an ax i ;ll 'y k"d aluminium foil/polyethylene laminated film immediately inside the outer 
sheath. Alternatively the spaces within the cable may be filled with a moisture-resistant sili- 
cone rubber or petroleum-based compounds. The exclusion of water is important in that it 
j^s been found that contact with water causes an increase in attenuation in silica fibers. When 
contact occurs diffusion of hydrogen into the fiber takes place. It either moves into inter- 
s titial spaces in the structure or reacts with the fiber dopants to form P-OH, Ge-OH or 
cj — OH. Increases in fiber absorption of 5 dB km" 1 or more can result. Further details on fiber 



able design are contained in ret". 8.27. 



lOTES 



jt is perhaps unfortunate that the commonly used symbol for this quantity should be 

identical for voltage, but there should be no confusion in the context in which it is used 

here. 

j n fact, there will be two modes. TE„ and TM„. 

It will be noted that we only have considered the case where 8<45° in Fig. 8.10. It is 
left as an exercise for the reader to draw an appropriate diagram for the case of 6 >4.V 
a nd to show that the corresponding phase difference between the intersecting beams is 
the same as in the 6 < 45° case. 

In circular waveguides the core field amplitudes are written in terms of Bessel functions; 
tn e number 2.405 represents the first zero of the / u (.v) Bessel function. We may note that 
tn e equivalent condition for planar waveguides is V = n/2. Significantly this represents 
the first zero of the cosine function, which is just the function which describes the core 
field amplitudes in such guides. 
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Calculate the critical angle for a water/air interface. The refractive index of water is 
1.33. 



& 



8-2 
8-3 



Verify that the phase change 5 on reflection at an interface for radiation polarized with 
its electric vector parallel to the plane of incidence is given by eq. (8.6a). 

Calculate the phase shifts introduced on reflection inside a planar waveguide where 
n j = 1 .48, n 2 = 1 .46 and the angle of incidence is 85°. 

Light of vacuum wavelength 0.85 (im travelling in a medium of refractive index 1.48 
is incident upon an interface with another medium of refractive index 1.46 at an anele 
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of 82°. Calculate the relative amounts of power present at distances (a) 5 um and (b) 
50 urn into the second medium. 

8.5 Sketch the graphs (to .scale) required to determine the ray angles of the allowed modes 
in a planar symmetric waveguide of thickness 4 um where the core and cladding refrac- 
tive indices are /?, = 1.50 and n 2 = 1 respectively and where radiation of wavelength 
1 um is being used. 

8.6 Estimate the attenuation of a 'non-guided' ray down a planar dielectric waveguide 
where >?, = 1.48, n 2 = 1.46 and d = 200 um. Assume that the ray has an internal angle 
of 0. 1 ° less than the critical angle and that the electric field is perpendicular to the plane 
of incidence. 

8.7 The output from a semiconductor laser is usually linearly polarized with the electric 
field vector in the plane of the junction. Explain why this should be. (Hint: consider 
the relative phase shifts expected on reflection (Fig. 8.2) and the resulting internal beam 
angles together with the consequent spread of the mode energy into the cladding.) 

8.8 A step index fiber has a numerical aperture of 0.16, a core refractive index of 1.47 
and a core diameter of 200 um. Calculate (a) the acceptance angle of the fiber, (b) the 
refractive index of the cladding and (c) the approximate maximum number of modes 
with a wavelength of 0.85 um that the fiber can carry. 

If the fiber is immersed in water (refractive index 1.33) will the acceptance angle 
change? 

B.9 Estimate the intermodal dispersion expected for the fiber in Problem 8.8. 

8.10 Calculate the maximum core diameter needed for a circular dielectric waveguide 
(refractive index 1.5) to support a single mode of radiation with wavelength 1 um when 
(a) the waveguide has no cladding and (b) the waveguide is of the step index type with 
A = 0.0I. 

Comment on the values obtained from the point of view of manufacturing single 
mode waveguides. 

8.1 1 A laser operating at 1 .5 um with a linewidth of 2 um is used in conjunction with a single 
mode silica-based fiber. Estimate the material dispersion expected. 

8.12 Show that when the source linewidth is exactly centred on the wavelength (=5l ), where 
A. ( ', d n/dX^ = 0, then the resulting material dispersion can be written 

L 

Ax = — 
8c 

(Hint: the simplest way of proceeding is via the group index A/ e (eq. 8.32); at Ji m , N 
will have a minimum value and hence about \ m will have a (X„ - X m ) 2 dependence on 
X . Thus we may write N(l d ) - N(XJ = [(X,, - k m ) 2 /2l](d 2 N/dl; } \.) 

8.13 Consider a waveguide bent into an arc of a circle of radius R. Determine the point in 
the cladding at which the phase velocity of a guided mode equals the phase velocity 




8-1 * 



8-15 



8- 



16 



8-17 



8- 



18 



8 



.19 



.20 



PROBLEMS 425 

of a plane wave in the cladding. Assume that the mode phase velocity is given by <■//;,. 
Hence show that the rate of energy loss is proportional to e\p{-R/R {] ) where R {) is a 
constant. Assume that the mode electric field intensity declines exponentially with dis- 
tance into the cladding, 

An optical power of 1 mW is launched into an optical fiber of length 100 m. If the 
power emerging from the other end is 0.3 mW. calculate the fiber attenuation (ignore 
reflection losses). 

In a particular ZBLAN type of fiber the individual contributions to the fiber attenuation 
were determined to be as follows: 

Rayleigh scattering: 0.9 + 0.67X" 4 dB km" 1 

tail from phonon absorption peak: 2 x 10" 3 exp(3/>.) dB km -1 

Determine the minimum attenuation achieved and the wavelength at which it occurs. 

Show that the relationship between the units dB km" 1 and absorption coefficient a (in ' ) 
is given by: 

dBkm-' = 4343a 

Two identical multimode fibers (of core radius a) have their end faces in contact but 
their core centres offset by a distance D. By ignoring Fresnel losses and assuming that 
the energy within each fiber is uniformly distributed across the fiber core, show thai the 
fractional energy transmission between the fibers is given by eq. (8.40). 

If the fibers have no offset displacement, but instead are separated by a longitudinal 
displacement S, show that the fractional coupling loss can be written as 
a 2 /(a + S tan a nm ) ; , where it has been assumed that radiation emerges from the fiber 
end with a uniform angular distribution up to the fiber acceptance angle. oc m . lv . 

In both cases sketch your results in the form of dB loss versus normalized 
misalignment (i.e. S/a and Dja) and compare with the practical results shown in Fig. 
8.35 (you will have to decide on a suitable value for cc„ u J. 

In an experiment to determine fiber scattering loss, a fiber is passed diagonally through 
a cube whose sides are lined with Si photodetectors. Light from an He-Ne laser 
(X. = 633 nm) is coupled into one end of the fiber and 4 mW of power are found to be 
emerging from the other (a short length is used). If the cube side length is 30 mm and 
the short circuit current obtained from all the cells connected in parallel is 100 nA. 
estimate the scattering toss in the fiber (in dB km" 1 ). You will need to assume a value 
for the quantum efficiency of the detectors. 

A Gaussian pulse with a = 2 ns is launched into a fiber 10 km long. At the other end, 
the pulse is still Gaussian but with a= 10 ns. Estimate the (electrical) bandwidth of 
the fiber. 

The refractive index profile of a graded index fiber is to be examined using the near- 
field scanning technique. Draw a schematic diagram of the apparatus required and show 
explicitly how the profile parameter may be obtained from the results. 
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8.21 When scattering takes place within a fiber, some of the scattered radiation will enter 
guided modes and travel back down the fiber. Assuming isotropic scattering within a 
step index fiber of numerical aperture NA and cladding refractive index n 2 , show that 
the fraction of the scattered radiation so captured is given, approximately, by 

(NA/2n 2 f. 

8.22 Deduce all you can about the fiber whose OTDR trace is shown in Fig. 8.46. 

8.23 It has been suggested that if a material could be found where the onset of the infrared 
lattice absorption bands is at a higher wavelength than in glass/silica fibers, then much 
lower ultimate fiber losses could be obtained. Estimate from Fig. 8.32 what the 
minimum absorption is likely to be at a wavelength of 4 u.m, assuming Rayleigh scat- 
tering is the dominant loss mechanism. 
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Optical communication systems 



Although the benefits of an optically based communication system have long been recognized, 
there were two main obstacles which at first stood in the way of a successful implementation. 
First, until recently it was impossible to modulate and demodulate light at anything other than 
very low frequencies. Secondly, without some sort of guiding medium, atmospheric trans- 
mission was limited to line-of-sight communications and was subject to the vagaries of the 
weather. It was the development of the laser that initially aroused renewed interest in optical 
communications, but early systems still had to employ atmospheric transmission. However. 
once the initial high absorption losses were reduced, the cladded dielectric waveguide became 
a practical proposition, and the problem of a suitable guiding medium was solved. Not all lasers 
are equally suitable for launching adequate power into these waveguides, and the semicon- 
ductor laser has proved the best in this respect. Interestingly, an incoherent source - the LED 
- is in many ways equally useful in multimode fiber systems, which brings home the point 
that at present the coherence properties of lasers are little used in optical communications. This 
is in spite of the fact that the techniques of heterodyne and homodyne mixing (section 9.1.1) 
do offer prospects of improved signal-to-noise ratios. At present we are still a long way from 
utilizing the full potential bandwidth; the highest modulation frequencies so far achieved are 
some 10 times smaller than those theoretically allowed by the carrier frequency. 

In the present chapter, we review current optical communication systems and give a brief 
indication of the directions in which they may develop. One such area of interest in this 
respect is that of integrated optics. Here, the aim is to miniaturize optical components such 
as sources, detectors, modulators and filters and to fabricate complete optical processing 
systems containing these items onto a single semiconductor chip. First of all, however, we 
discuss the types of modulation that are most applicable to optical communications. 



(_2iL) Modulation schemes 

The term modulation describes the process of varying one of the parameters associated with 
a carrier wave to enable it to carry information. Variations in the amplitude, irradiance, fre- 
quency, phase and polarization can all be used for this purpose. When the carrier wave oscil- 
lates at optical frequencies, however, not all of these possibilities are equally suitable. For 
example, most optical detectors respond only to the irradiance of the light, and so modula- 
tion of anything other than the irradiance is not very useful unless special techniques, such 
as those discussed in the next section, are used. 
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There are many different methods for transferring information into variations of the appro- 

r j a te wave parameter, and we divide these into three categories, namely analog, pitlsv and 

jivital. With analog modulation, the primary information signal, which we take to be a time- 

r ving electrical voltage, continuously varies the appropriate wave parameter. Thus, at any 

n e time there is a one-to-one relationship between the original signal amplitude and the mag- 

jtude of the wave parameter. In both of the other two methods, the signal amplitude is only 

. , ir npled at regular intervals and this information is then conveyed by means of a series of 

, u lses'. In pulse modulation, the pulse width may be varied in proportion to the required 

,- gn al; alternatively, a fixed-width pulse may be used and its time of occurrence within a 

f^ed time slot used for the same purpose. There are also several other possibilities. In digital 

„Mulation. the information is provided bv using a string of pulses whose timing and widths 

both fixed but whose amplitudes are restricted to certain "quantized' values. Figures 

i (a)-( c ) illustrate these techniques. 

Of these three categories, the most common arc analog and digital, and it is to these that 



vve 



now direct our attention. 



.j -j Analog modulation 

A vve indicated above, not all of the wave parameters - amplitude, irradiance. frequency 

d phase - are equally suitable for modulation purposes at optical frequencies. The main 

•fficulties arise in signal demodulation, since detectors respond only to the irradiance of 

C r adiation falling on them (see eq. 7.30). 

We suppose that the magnitude of the electric field of the carrier wave can be written 

r £ t (/)=.4,eos(Dy + <tg 

re A^, u t . and <[> L . represent the carrier amplitude, angular frequency and phase respec- 
iy f all of which may be modulated and hence be time dependent. If this signal is allowed 

tJ c,,\l directly onto a detector (in so-called direct detection), then the output of a detector 

C which responds only to irradiance is given by 

£, d = fr4;:(cos 2 (oy + <l> c )) 

„ R is the detector responsivity and the angle brackets indicate an average taken over a 
-np' ete P er ' oc ' oi '^ e runct i° n inside. Thus eq. (9. 1 ) indicates that in direct detection only 
°° sig na ' i rracuarice ( w hich is proportional to the square of the amplitude) is recoverable. 
t * 1 , Vence modulation of the other wave parameters is not possible. There are. however, other 
an venues available which do enable further information to be extracted from the received 
te cn { 

' * I n heterodyne detection, the incoming signal is mixed with one from a local oscillator (Fig. 
„„ which we describe by 

% if) =^ o cos(co,/ + (t) (1 ) 

re u is chosen to be very close to w,, while in contrast to A c , A will not. in general, be 
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(a) Analog modulation 
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irradiance 



(c) Binary digital modulation 




Time 
FIG. 9.1 Three possible schemes for modulating an optical light beam, in (a) the amplitude (and hence 
the ,rrad,ance) .s continuously modulated in time. The envelope of the waveform gives the required signal 
In (b) the signal ,s sampled at the beginning of each 'time slot' and a pulse is emitted during that time slot 
whose length .s proportional to the signal amplitude. A digital signal is shown in (c). Here, as in (b) the 
signal is sampled at regular intervals and then a series of pulses is emitted which indicates (in binary nota- 
tion) the s.gnal amplitude. The presence of a pulse during a lime slot indicates a 'one', while the absence 
of a pulse indicates a 'zero'. It should be noted that for illustrative purposes the carrier frequency has been 
made much smaller than would occur in practice. 
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a 2 Schematic diagram of a heterodyne detection receiver. The incoming signal (Irequency Lu t ) is 

:»U ^^q f ror 



j vvith one from a local oscillator (frequency d o ) using a beam splitter arrangement. The combined 
j s fall onto a photodetector and the electrical output of the latter is passed through a bandpass filter 



mi* e 

5 od o n frequency co r - a) 



^ rlependent. The output of the detector now becomes 
time <•* ^ 

^ d ^r« c (<)+-M/)] 2 > 

M 2 ^ 2 \ 

= tfl y + — + 2A l A i ,(cos(m c t + <t> c ) cos(u / + «>„)) 



(9.1 



jsfo 



AV 



<COS 






(ay + <t>,) cos(ay + <(>„)> = j <cos[(w c + wjr + (<tp c + <j>J] + cos[(a) c - tajt + (<|> c - <(»„)]> 

^ CJ i- s c I° se ro w <.- tne term cos[(cli l . - a> )f -i- (<j» c - (j) )] oscillates much more slowly than 
the r an ^ tnus ' r ma y k e separated out by inserting an electrical bandpass filter centred 
tfr e _ to n ) in the output of the detector (Fig. 9.2). The resulting output is then 

^ = /MA cos[(cj c - wjr + (f. - f,)] (9.3) 

• s n° w dependent on carrier amplitude, angular frequency and phase; hence each of these 
& d ^peters may be used to carry information. 
P^ T fromodyne detection, the local oscillator is set to the same frequency as the earner. In 



tm s 



situation, eq. (9.2) becomes 



in serting a low frequency (i.e. much less than oj c ) bandpass electrical filter after the detec- 
^ vse may block terms other than A 2 j2 and A L A cos(<J> c - <> ). If, in addition,^ .*A then 



^e- 



\\& 



1* 



d */?A,A,co.s(4> c -<|> o ) (9.4) 

in homodyne detection both amplitude and phase modulation are possible. We also 



us 
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see thai for both heterodyne and homodync detection the signal amplitude is a factor ,4 /^ t 
larger than for direct detection. When,4 o >4 c> we have signal amplification, which enables 
higher sensitivity to be obtained. Thus A can be increased to the level where the receiver 
noise is dominated by the shot noise contribution from the local oscillator (section 9.3.3.1). 
In practice, improvements of some 1 to 20 dB in signal-to-noise ratios have been observed 
over direct detection techniques (ref. 9. 1 ). Coherent systems are not yet, however, in common 
use mainly because of the additional complexity (and hence expense) required to achieve 
the necessary frequency, phase and polarization stability. They will be discussed further in 
section 9.3.9. For most of the rest of the chapter we will assume the use of direct detection. 

9.1.2 Digital modulation 

In digital modulation the information carrier can assume one of a number of discrete states. 
The simplest scheme is called nvo-level binary: in this there are only two such discrete states, 
which are conveniently referred to as "zero" and 'one*. Thus in amplitude modulation the 
"one' state may be represented by a pulse which is greater than some predetermined ampli- 
tude and the 'zero' state by a pulse that is less than this amplitude. The pulses are taken to 
occur within fixed time slots. If the pulse width is less than that of the time slot it occupies, 
then we refer to a rcturn-to-zero (RZ) signal. Conversely, if the pulse fills the time slot, we 
refer to a non-retum-to-zero (NRZ) signal. Figure 9.3 illustrates these ideas. The complete 
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FIG. 9.3 



In a non-return-to-zero (NRZ) signal (a), eat h pulse exactly fills the time slot. In a return-to-zero 
(RZ) signal (b), the pulse width is less than that of the time slot. 
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ro cess whereby an analog signal is converted into a digital-pulse-coded one is shown in 

pjo;. 9-4- The amplitude of the incoming signal is measured at discrete intervals and the result 

on verted into a binary number. For example, in Fig. 9.4(a) the signal amplitude at t = 2 ms 

- 3 V, the number 3 in binary notation is Oil (we employ a three-digit binary number here 
for sii" rl Pl' c ' t y' in practice, eight or more binary digits ov bits are required). During the time 

- te rval between this sampling event and the next, the information is transmitted as a .series 
f pul- ses - Thus, the signal heights at the five times indicated in Fig. 9.4(a) become converted 



into 



the digital signal shown in Fig. 9.4(b). 



/yt f« rst sight, this scheme seems somewhat wasteful since it requires a much higher system 

r- e quency bandwidth than would be needed for the corresponding analog signal. To reproduce 

j v en signal, the sampling theorem (see ref. 9.2) tells us that we must sample at a rate thai 

• • at least twice that of the highest frequency component in the signal. For example, if an 8 

n't n 11171 ^ 1 ' s uset ' f° r eacn amplitude measurement the bit rate required will be 16 times the 

. 2 )-ieSt signal frequency. To reproduce each bit with reasonable accuracy, the system frequency 

pori.se must be at least equal to the bit rate. In telephone communication systems, for example. 

. highest signal frequency is usually 4 kHz; a digital system would therefore require a bit 

- te °f some 4x16 kbps or 64 kbps and the frequency bandwidth would have to be greater 

n this to enable the shape of the pulses to be reproduced with reasonable accuracy. 

"The S reat advantage of a digital system over an analog one is its relative freedom from 

- se or distortion. Inevitably in any communication system the transmitted and received 



sig 



na ]s will not be identical. Noise may be introduced and non-linearities in component 
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Transformation of mi analog signal into a digital one. The analog voltage (a) is .sampled ever 



F ' -ii'ist'CO ncl and eaih measurwnenl (onverted into a 3 bit two-level binary NRZ signal (b) 
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response will distort the signal. In analog systems, there is no means of rectifying this (at 
least when using direct detection), but a digital signal can suffer severe distortion and still 
be capable of accurately imparting the original information. In Fig. 9.5 we show a severely 
degraded signal but, provided that during each time slot we can always make the correct deci- 
sion a.s to whether it contains a 'one' pulse or a 'zero', the original signal can be exactly 
reproduced. The way in which this decision is usually reached is to set up a decision level. 
If the signal exceeds this level at a particular time (the decision time) within each time slot, 
then a "one' is recorded; if it does not do so, then a 'zero' is recorded. 

This doe.s not mean, however, that digital signals are entirely error free. Inevitably the fluc- 
tuations in signal level mean that there is always a possibility of making a mistake as to whether 
a 'one' or 'zero' has been received. The bit-error rare indicates, for .some average signal level, 
the probability that a particular bit will be in error. If it is assumed that the signal amplitudes 
exhibit a Gaussian distribution, and that the decision level is set halfway between the 'zero' 
level and the 'one' level, then it can be shown (ref. 9.3) that the relationship between the bit- 
error rate, BER, and the (electrical) signal-to-noise ratio in the .signal, S/N, is given by 

2 \ 2V2 j 
where erfc is the complementary error function, given by 
2 



erfcCv)=l--=J > eX p ( _ x 



V^ 



dv 



(9.5) 
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FIG. 9.5 Illustration showing how a severely degraded digital signal can still he restored to its 
condition. 



original 
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j^ graph of the BER as a function of S/N ratio is shown in Fig. 9.6. A fairly standard BI-R 
for telecommunication systems is I0" y : this is seen to require an S/N ratio of 21.6 dB (or 

144)- 

We have assumed in this discussion that the detector 'knows' the position and spacing 

f the time slots and also the height of the decision level. Usually, however, the detector 

can only determine this information from the signal itself. A standard sequence inserted 

bef° re l ^ e messa S e proper begins helps, but both the pulse amplitude and the timing of the 

jjjses may very well 'wander' somewhat during the message, and the detector circuitry must 

u, e able to maintain updated values of these parameters. Fortuitous groupings of pulses can 

ma jie this task difficult. For example, the decision level is usually determined by taking the 

ve rage amplitude of a sequence of received bits. If there are equal numbers of "ones' ami 

, er os', this procedure will be adequate. If, however, a fairly long sequence of the same bit 

cC urs, which is not as unlikely as it seems at first, then the decision level may change to 

uch an e xtent that errors become likely when a more even sequence subsequently appears. 

-r overcome these problems, it is customary to modify the message so that sequences do 

n tain a fairly even distribution of 'ones' and 'zeros'. This is known as line coding. Several 

u e mes for this are available (see ref. 9.4). most of which involve adding extra bits to the 

\- gfl al an d hence increasing the required system bandwidth still further. There is generally 

trade-off between coding complexity (and hence the increased bandwidth required) and 

. e accuracy of signal recovery. A simple example of such a coding scheme is that of coded 

t£2 rk inversion. In this, an input of '0' is signalled as 01 and sequential 'ones' are signalled 

j ter natively as 00 and 1 1. Long runs of the same bit are then avoided and some measure 

f err° r ^ etect ' on ' N a '- s0 possible since the sequence 10, when referring to an original bit, 

would not occur. However, the number of bits required has now been increased by a factor 

c fvvo. Most line coding schemes adopted in practice increase the required bit rate by some 
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jfi our discussion of digital modulation we have so far only considered amplitude modu- 
n (often referred to as amplitude shift keying or simply ASK). We saw in section 9.1.1 
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9.6 Bit-error rate (HER) as a function of (electrical) sigrul-to-noise ratio as given by eq. ( c ).Sl. 
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FfC. 9.7 illustration of (a) frequency shin keying (FSKl and lb) phase shift keying (PSK). A possible inter- 
pretation of the signals in terms of 'ones' and 'zeros' is aiso shown. 



that using coherent detection it is possible to detect changes in both frequency and phase. 
Thus digital modulation can also involve frequency shift keying (FSK) and phase shift keying 
(PSK); these are illustrated in Figs 9.7(a) and 9.7(b). 



Free space communications 

Although fiber optical communication has become increasingly important over the last few 
years, tree space communication over distances of up to a few kilometres is also a practi- 
cal possibility provided only fairly low bandwidths are required. Systems are easily set up 
and costs are considerably less than if using fibers. The low beam divergence of a weil-col- 
hmated optical beam can have considerable security advantages over radio communications. 
A typical application would be a voice or low data rate link between nearby buildings in an 
urban environment. 

An obvious disadvantage is that adverse atmospheric conditions, such as rain or snow. 
may introduce severe distortion or even render the system completely inoperable. Even when 
conditions are favourable, the signal will be attenuated by both absorption and scattering 
processes in the atmosphere. The former arises from the presence of molecular constituents 
such as water vapour, carbon dioxide and ozone, whose exact concentrations depend on many 
variables such as temperature, pressure, geographical location, altitude and weather condi- 
tions. Strong absorption occurs around wavelengths of 0.94 u. m . 1.13 urn, 1 .38 urn, 1 .90 urn, 
2.1 urn, 4.3 urn and 6.0 p.m. Between these values lie the so-called atmospheric windows, 
where losses are mainly determined by scattering. Scattering is primarily due to particles 
that are large compared with the wavelength, such as smoke or fog. and this is known as 
Mie scattering (this is in contrast to Rayleigh scattering where the panicle size is much 
smaller). The effective absorption coefficient in Mie scattering varies relatively little with 
wavelength. A typical atmospheric absorption spectrum is shown in Fig. 9.8. 

Another problem is that of atmospheric turbulence; heating of the air in contact with the 
earth s surface creates convection currents in the atmosphere. Since the refractive index of 
air is temperature dependent, we experience refractive index variations through the atmos- 
phere causing beam deviation and spreading. This can give rise to sudden and pronounced 
fading of the signal. Under ideal conditions, however, ranges of up to 150 km are possible, 
but to achieve reasonably large bandwidths. distances should be less than 2 km. Fluctuations 
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in absorption and scattering tend to modulate the signal strength in the range 1-200 Hz so 
that amplitude modulation is not usually very satisfactory. 

The simplest systems make use of LEDs and silicon photodiode detectors. These may be 
coupled by suitable lens or mirror systems such as those shown in Fig. 9.9. More complex 
systems have been proposed using sources such as Nd: YAG and CO, lasers where the inher- 
ently small laser beam divergence greatly simplifies beam collimation. Modulation may be 
achieved in a variety of ways. For example, a CO, laser operating in a single mode may be 
frequency modulated by introducing an electro-optic crystal into the resonant cavity. A signal 
voltage applied to the crystal changes its length and hence the optical length of the cavity. 
This in turn causes the single mode resonance frequency to change. Alternatively. lasers such 
as the Nd: YAG can be made to emit a train of pulses by the technique of mode locking (see 
section 6.3). A signal can then be impressed on this pulse train using an external modula- 
tor. Such systems as these, however, have been rarely used, primarily because of their com- 
plexity and high cost. A possible future use is in deep space communications, where there 
are no problems with transmission through an atmosphere and where the low beam diver- 
gences are useful in view of the laree distances involved. 



L_l_> Fiber optical communication systems 

The advent of optical fibers with losses of less than 1 dB km" 1 and with high information- 
carrying capacity (e.g. bandwidths > 100 MHz) has meant that they have become very attrac- 
tive alternatives to twisted wire or coaxial cables in many types of communication links. 
The mam problem with the latter is that their attenuation increases as the square root of the 
frequency of the carrier. This arises because of the so-called skin effect, whereby, as fre- 
quency increases, the oscillating electrons are confined to an increasingly thin annulus round 
the outside of the metallic conductor. At a frequency of 100 MHz, for example. 9.5 mm diam- 
eter coaxial cable has an attenuation of about 20 dB km" 1 , significantly worse than the optical 
losses of silica-based fiber even at a wavelength of 850 nm (see section 8.4). 

A schematic diagram of an optical transmission system is shown in Fig. 9.10. The emitter 
is usually an LED or semiconductor laser, whilst the detector may be a p-i-n or avalanche 
photodiode. Also included on this diagram are what are known as repeater units which are 
present to counter the effects of fiber transmission losses and dispersion. In them the weak, 
and possibly broadened, optical pulses (we are assuming digital transmission) are detected 
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FIG. 9.10 Schematic diagram of the main components of a fiber optical communication system involving 
a repeater. 
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j converted to (weak) electrical pulses. These pulses are then reshaped, amplified and 

f-.ffied, to reproduce, as far as is possible, the original noise-free pulse train. Finally a new 

„,'t-i of optical pulses is generated for transmission down the next section of fiber. With an 

j g signal it is not possible to restore a distorted signal; any signal distortion or noise 

s6 nt is, of necessity, passed on along the next section. A separate power supply line must 

v, provided for the repeater units and. as might be imagined, their presence adds greatly to 

cost of a link. As we shall see in section 9.3.5. the need for repeaters has been almost 

hol^y obviated by the development of the optical fiber amplifier. 

T~}ie main advantages of optical fiber links include relatively low signal attenuations, high 

j vvidths (up to several gigahertz or more), small physical size and weight, the elimina- 

• n of g roun d I°°P problems and immunity from electrical interference. This last advan- 

j S especially useful in electrically noisy environments such as densely populated urban 

s and power stations. There is. of course, no spark hazard - an additional advantage in 

i oS ive environments such as chemical plants. Fiber cables may also survive better than 

C npeJ" cables in certain corrosive environments, for example in sea water. 

c tpfte availability of optical fibers has revolutionized telephone 'trunk' links, that is links 

j-,le of carrying a large number of simultaneous telephone conversations between tele- 

P sa vv in section 9.1.2 that a single channel telephone link requires a bandwidth of 4 kHz. 



buildings. These may be from a few kilometres up to several hundred kilometres apart 

We sa 

for a digital signal, 64 kbps. Telephone networks require links capable of carrying many 

° ' i e channels simultaneously. Digital transmission is able to cope with this easily since 

sl se parate bit streams required may be interleaved. In both the United States and Europe, 

1 rla^d rates have been agreed upon for various 'levels' of transmission and these are shown 

rj-gble 9. 1 . Fiber optical links offer obvious economic advantages for medium to long haul 

lfI (10 km and upwards) with larae capacities (100 Mbps and upwards). 
linK^ v 



-p 9.1 Digital rates used in telecommunications in 
T c and the United States 
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There are numerous other applications including, for example, undersea links, video trans- 
mission, computer links and, in the military sphere, missile guidance. The concept of an 
optical local area network (LAN) will be discussed further in section 9.3.7. Several pilot 
experiments have been run to examine the feasibility of providing the whole of a commu- 
nity's communication/information needs (such as telephone, TV, radio, etc.) using fiber 
optics. It is obviously impossible in a text of this size to cover these and other areas of inter- 
est adequately, and the reader is referred to ref. 9.5 for further information. 

The individual components involved in a fiber optical communication link (emitters, fibers 
and detectors) have all been covered in general terms in previous chapters, and we now review 
the possibilities open to us which meet the requirements of a communication system. We 
consider first the choice of operating wavelength. 

9.3.1 Operating wavelength 

The two crucial characteristics of optical fibers that depend on wavelength are attenuation 
and material dispersion. A typical attenuation versus wavelength curve for a silica-based fiber 
was shown in Fig. 8.32. Material dispersion, that is the variation in group velocity with wave- 
length, was covered in section 8.3.6. There we saw (eq. 8.33) that the pulse spread Ax over 
a fiber of length L could be written as 

c dx,-, \, 

The pulse spread per unit length per unit wavelength interval. At/LAX,„ for silica is shown 
in Fig. 9.1 1. The first-generation optical fiber transmission systems used emitters based on 
GaAs or GaAlAs which operate in the wavelength range 0.82 urn to 0.9 urn. This is by no 
means an ideal wavelength region: at 0.85 urn silica fibers have minimum attenuations of 
about 2 dB km - ' whilst their material dispersion is some 80 ps nirT 1 . Thus, light from a 
typical LED with a linewidth of 50 nm would suffer a dispersion of 4 ns km" 1 m" 1 . Over a 
10 km fiber length, the system bandwidth would then be restricted to some 25 MHz by mate- 
rial dispersion alone. Lasers, of course, have a considerable advantage here: by virtue of their 
narrow hnewidths, they enable much higher bandwidth* to be achieved (see Example 8.10). 

Fiber attenuation may be considerably reduced by working at longer wavelengths, the 
minimum (about 0.15 dB km" 1 ) occurring at a wavelength of about 1.55 Lim. Material'dis- 
persion also decreases at longer wavelengths, becoming very small indeed at around 1.3 luti, 
where values of a few picoseconds per nanometre per kilometre can be achieved. All long 
distance, high capacity telecommunication links now operate at wavelengths of 1.3 luti or 
1.55 urn, and suitable p-i_ n an d APD detectors for these wavelengths based on InGaAs 
are now readily available. 

Plastic fibers have extremely high attenuations outside the visible region (Fig. 8.41) and 
are usually used in short distance links in conjunction with red-emitting LEDs. 

9.3.2 Emitter design 

The main requirements for an emitter in an optical communication system are that it must 
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■ r; 9.1 1 Material dispersion tor pure silica as derived from the data shown in Fig. 8.24, and expressed 



be able to couple a useful amount of power into the fiber and also be capable of operating 

j t h the required frequency modulation bandwidth. The relatively small diameters of optical 

f{\yer cores (from several hundred micrometres down to a few micrometres) imply eorre- 

poridingly small source sizes for efficient coupling of radiation into the fiber. Consequently, 

the ^ osX commonly used sources are LEDs and semiconductor lasers which were discussed 

chapters 4 and 5 respectively. 

Compared with semiconductor lasers, LED sources are easy to drive, have long lifetimes 
^a a re inexpensive. Their principal disadvantage, apart from their greater linewidth, is that 
they are much Iess efficient at launching power into fibers than are lasers. This is mainly 
w;C aUse of their Iarger emit,in g area > greater beam divergence and incoherent light output. 
^ c now calculate the maximum power that can be coupled into a multimode step index fiber 
ff0 ni a source in contact with it. We assume that the source area A s is less than or equal to 
t Y\C f iber core area A *- Fi S ure 9A2 illustrates the basic geometry. For each point on the emit- 
ting surface ' onIy ,ight emitted up t0 an m ^ c a ™* . where a m;lx = sin" 1 (NA//i ), see eq. (8.2 1 ). 
wit h the normal to the surface will propagate down the fiber. If the source brightness as a 
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FIG. 9.1 2 Illustration of the coupling of light from a flat emitting surface into a fiber optical waveguide. 
Rays making an angle less than a mM with the normal to the fiber end will become trapped in the fiber core. 
The gap between the emitter and the fiber has been deliberately enlarged to show the angles involved. 

function of angle is written B(a), the total energy O f coupled into the fiber is given by 

F = /iJ fl(a)dQ (96) 

n 

Here dQ is the .solid angle subtended by rays between the angles a and a + da. and the inte- 
gral is carried out for all rays that remain trapped in the fiber. The relation between dfi and 
da can readily be derived with the aid of Fig. 9. 1 3 and is 

d£2 = 2n sin a da 
Hence, inserting this value for dfi into eq. (9.6) yields 

*f = 2tl4J """ B{a)sin a da 

o 

LED sources are usually approximately Lambertian so that we may put B(a) =fl(0)cos a 
(see Problem 4.6). Equation (9.6) then becomes 

O f = 2nA s B(0) f "'" cos a sin a da 

o 

= T^ s 5(0)sin 2 a milx 



or 



<P f = kA s B(0) 



(NA) : 
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Since 



9.1 3 Relation between solid angle dQ and da may be derived by noting that df) - (BI ) • Jn • I )I.i/AIV 



gD = AB da and DE/AB = sin a, we have dO = 2n sin a da. 



«/ e know from eq. (4.2lu) that the total output of the LED is jl4 s S(0) and hence the coup- 

„ efficiency n. mav be written 
ling c c 

energy coupled into fiber 7l4 s fi(0)(NA) 2 
1 ^ t ' energy emitted by LED nA^B(0)ii^ 



tfr* 11 



is 



(NA)- 



ric 



(y.7> 



v pical value for the numerical aperture of a multimode fiber would be 0.3. Assuming an 

- interface (i.e. /;„= 1 ), eq. (9.7) then shows that only some 9% at most of the total radia- 

a . e rnitted by the surface iii contact with the core area will enter the fiber. (Fresnel losses 

tl ve been ignored in this derivation since they amount to only a few per cent of the energy 

,nled into the fiber.) If A„ is larger than A,., the coupling efficiency will be further reduced 

for a graded index fiber the calculation is more complicated. We note, however (section 

r, 3), that a parabolicaily graded fiber (i.e. oc = 2) with the same diameter as a step index 

L&r is only able to support about half the number of modes, and hence the coupled energy 
fiD ei ' " J 

is 



* peered to be about a factor of two smaller. 
^p maximize the coupling efficiency into multimode fibers, we therefore require a fiber 



■$i as large an NA as possible and a source with an area no larger than the fiber core area 
Ifl n'alty of a smaller signal bandwidth because of increasing mode dispersion (see eq. 8*24 



cr easing the NA, however, implies that the factor (h, -//,) increases, thus incurring the 



V 
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Jr. addition reducing the emitting area of an LED whilst maintaining total output tends to 
recipe the operating lifetime. 

^The two main types of LED most often used in fiber optical systems are the surface-etched 
well emitter ('Burrus" type) and the edge emitter, both of which are illustrated in Fig. 9. 14. 
In the former, a well is etched into the top of a planar LED structure to enable the fiber end 
to be as close as possible to the light-emitting region (i.e. to the p-n junction). If the emit- 
ting area is less than that of the fiber core, then some form of optical coupling between the 
source and the fiber may be advantageous. For example, a spherical lens on top of the emit- 
ting surface will magnify the effective surface area, but demagnify the solid angular distri- 
bution of the radiation, so that the resulting radiation pattern may be a better match to the 
fiber acceptance pattern (Fig. 9.15). Other types of lenses that can be used to enhance .source- 
to-fiber coupling efficiency are the sphere and GRIN tvpes discussed in Chapter 1 (see section 
1.3.3). 



Fiber 




Epoxy resin 



Etched 'well' 



Radiation 
Active area 



Current flow 



.*- Si0 2 mask 
Metal contact 



(b) w^ssJWf*?™^ 

GaAs-p 4 



Active layer 
(GaAlAs-n")' 




Fiber 



FIC. 9.14 Two types of emitter designed for more efficient coupling into optical fibers: (a) the etched well 
or 'Burrus' type and (b) an edge emitter. In (a), the active light-em.ttmg area is restricted to a small region 
just below the end of the fiber by the use of an SiO , mask. The fiber is held in position by the use of a 
transparent epoxy resin which also helps to reduce Fresnel losses. In the edge emitter, the radiation is 
confined to a narrow light-guiding layer with a structure very similar to that of the doable heterostructure 
laser. 
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9.15 One way devised to increase the coupling efficiency of emitters to fibers is to incorporate some 
i lens system; here, the effect of a spherical lens is illustrated. 



-fhe construction of the edge-emitting LED is very similar to that of the semiconductor 

er (see section 5.10.2). the radiation being confined to a narrow channel. On emerging. 

he beam divergences tend to be narrower than those of the surface types with typical half- 

~-s#eT divergences of 50° and 30° perpendicular and parallel to the junction plane respcc- 



\y (a Lambertian source has a half-power beam divergence of 60°). To couple this 



. tr ibution into a fiber efficiently, an anamorphie optical system is sometimes used, .sue! 
t\\dX shown in Fig. 9. 16. 



t ive 
i: 

-fotal optical powers from edge emitters are typically several times smaller than from 

. T f£ice emitters, but the narrower beam divergence can give rise to more coupled power. 

%dB e emitlers are usua "y preferred for use with small NA fibers (i.e. NA<0.4), whereas 

ur face emitters, because of their greater total power output, are better for large NA fibers. 

■ ^ s far as output characteristics are concerned, LEDs have an almost linear relationship 

tV veen drive current and light output. This makes the LED more suitable for amplitude 

n pulse modulation, .since for the latter the drive current must be switched from high to 

\ vv values to obtain a wide ratio between "on' and 'off outputs (say from 300 niA to below 

\q m A )' The factors that uItimate 'y affect the modulation bandwidths attainable are discussed 

n section 4.6.5. At low current levels, the limiting factor is usually the junction capacitance, 

* l^ilst at high current levels it is the lifetime of carriers injected into the recombination region. 

is shown in Appendix 3 that when the response is lifetime limited, we may write 



It 



/?(/") = 



(1 +47r/'V) l/: 



(9.K) 
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Strongly asymmelrK al radiation patterns from edge emitters can more efficiently be coupled 
mlo a fiber using an anamorphic lens (i.e. one with differing curvatures in two orthogonal directions). 



where R(f) is the relative response at frequency/and t is the minority carrier lifetime. In 
GaAs a typical value of t would be 1 IT" s, giving a half-power bandwidth of some 300 MHz. 
We turn now to a consideration of laser sources. In comparison with LEDs, they exhibit 
much higher output power, narrower beam divergence and a smaller emitting area. All of 
these factors enable lasers to couple significantly more power into fibers than do LEDs (see 
Problem 9.5). However, an even more significant factor is that lasers emit coherent radia- 
tion and it is this that enables them to couple significant power into single mode fibers. The 
derivation of eq. (9.7) used simple ray theory which we know is not valid in single mode 
fibers. To launch light into a particular mode in a fiber with 100% efficiency it is necessary 
m fact to reproduce on the face of the fiber the exact field amplitude distribution corre- 
sponding to that particular mode. In the event that this is not possible, then the actual power 
coupling efficiency into the fiber, tt c , is given by evaluating the so-called overlap integral 
so that the coupling efficiency becomes 



n, 



[JV£ ra (.v..v)£ s (.v. v)dvdv]- 



(9.9) 



/"„ 'i - m (.v. y)JJx, v) dv dy J~„ f^{.\; y)f s (.v. y) dv dv 

where V m (.v. y) is the required mode field inside the fiber, and f s (.f, y) the actual field dis- 
tribution over the end of the fiber. 

As we saw m section 8.3.4 a reasonable approximation to the mode field distribution for 
a single mode fiber is given by a Gaussian function (eq. 8.29) and we may write the field 
as <y/') = <y0)exp[-(/-/co () ) : ]. Similarly the beam from a single mode He-Ne laser, for 
example, also has a Gaussian field profile. If such a beam is focused onto the end of the fiber 
the field profile will remain Gaussian, and we may write for the focused beam 

^(o^Harjexpt-f,-/^) 2 ] 

Equation (9.9) may now be used to evaluate the coupling efficiency of the beam into the 
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fiber. The integrals may be carried out using radial coordinates (see Problem 9.6), the result 
being 

4f^o4 
r\ c Z= ,~T7 — ^~3 (9.10) 

for example, if the focused spot on the end of a single mode fiber has a mode field diame- 
ter which differs by a factor of two from that of the mode field in the fiber then from eq. 
(9. 1 0) the coupling efficiency will be (4/5 ) : or 0.64. It is reasonably easy to obtain coupling 
efficiencies between 0.5 and 0.6 using simple butt jointing techniques in conjunction with 
s j n gle mode fibers and semiconductor laser diodes. 

Lasers, of course, have very different output characteristics from LEDs. From Fig. 5.25 
yJI/e see that very little radiation is emitted until the current reaches a threshold value, after 
v/foich the output rises very sharply. Such a characteristic is well suited to generating digital 
signal*, since the drive current needs only a small swing to provide a high 'on* to 'off con- 
traS t. Analog signals can also be transmitted, since the characteristic above threshold is rea- 
sonably linear. (Sometimes, however, 'kinks* can develop as discussed in section 5.10.2.3.) 
Q n e problem which severely affects both types of signal is that the threshold current is tem- 
perature sensitive and can show long term drift with age. Good temperature stabilization is 
t heref° re required together with some form of feedback mechanism to counter any change 
in th e characteristic. The most common technique with digital signals is to maintain a con- 
stant power output. For this purpose, a photodetector may be mounted next to the rear facet 
f the laser. The drive current may then be adjusted until the photodetector (which need not 
ha ve a ver y ^ ast res P onse ) indicates some predetermined average power level. 

j\n important requirement for high bandwidth systems is to have as small a source linewidth 
s possible so that material dispersion is reduced (eq. 8.34 and Example 8.10). A semicon- 
ductor laser with a Fabry-Perot-type cavity usually has a (multimode) linewidth of about 3 urn. 
rr-fois linewidth can be reduced by ensuring that only one longitudinal mode is present, for 
xa mple by using the distributed feedback structure (section 6.2). Linewidths can then 
pproach 10 MHz or so (corresponding to 10" 4 nm). Further reductions (down to a few tens 
f fcilohertz) can be obtained by using some form of external cavity (section 6.2). 
/another factor that needs to be taken into account is the fact that the output pulse from a 
laser is 'chirped', that is the emission frequency changes during the pulse. This arises from 
fiumber of factors: for example, within the semiconductor laser material the operating wave- 
length de P eruIs on thc product of cavity length multiplied by the refractive index. Both these 
quantities are affected by the changes in temperature that will take place during a current 
u l.se. In addition the refractive index decreases as the carrier concentration increases. Thus 
[he effective spread of wavelengths emitted by the laser is increased. This in turn increases 
tn e material dispersion and reduces the maximum bit rate possible. One solution to this is 
t o operate the laser CW and modulate the radiation externally (possible devices will be 
, scribed in section 9.4.2). 

In conclusion it is evident that lasers have considerable advantages over LEDs from the 
point of view of coupling the maximum amount of power into fibers (especially single mode 
^bers) and giving minimal material dispersion. The disadvantages are that they are more 
expensive and require somewhat elaborate temperature control and output stabilization. 
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9.3.3 Detector design 

Most detectors have large sensitive areas compared with fiber core areas and, Fresnel losses 
apart, can easily collect most of the radiation being carried by the fiber. On the other hand, 
too big a detector area is often a positive disadvantage since this implies an unnecessarily 
high dark current noise and a high detector capacitance. Thermal detectors are, in general, 
much too slow and insensitive and the almost universal choice has been for some kind of 
junction device. Of these, the main choice has been between p-i-n and avalanche photo- 
diodes. The latter provide a substantial amount of gain which is useful in increasing the 
system sensitivity but only when this is limited by Johnson noise in the load resistor or noise 
in the amplifier following the detector. To illustrate this point we now carry out a simpli- 
fied noise analysis for both a p-i-n and an avalanche (APD) detector. 



9.3.3. 1 Noise analysis for a p-i-n detector 

The equivalent circuit of the detector and amplifier as far as noise analysis is concerned is 
shown in Fig. 9.17. If optical power P at a wavelength X n falls on the detector then, if the 
detector quantum efficiency is rj, the resulting signal current i x may be written (see eq. 7.25) 



T}Pel t 
he 



(9.11) 



Also flowing through the load resistor is the diode reverse bias saturation, or dark, current 
i n . The total current i x + i D will give rise to shot noise in the current with an r.m.s. magni- 
tude given by eq (7.12), that is 

A ( / x + ' D )sh«t = [2('\ + / D )M/] 1 / 2 (9 . 12) 

Johnson noise is also present in the load resistor R L ; from eq. (7.14) the r.m.s. Johnson noise 
voltage is 

AV, = (4kTR L Af) [ l 2 

The equivalent r.m.s. noise current M i is then 

(4W7? L A/) 1/2 



M } = 



R, 




~> 



'*L 



FIG. 9.1 7 Assumed equivalent circuit for noise analysis in a p-i-n diode. A current source i K is shunted by 
the diode junction capacitance C, and the load resistor R L . 
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Inevitably, the signal levels are usually such that amplification is required and any not.se gen- 
erated in the amplifier may also be important. Jt is useful to deal with this by including it in 
the Johnson noise by introducing a 'noise factor' F n (where F n >\) so that 

(4kTF n R L Afy 2 
A'j * ^ ~ (9.13) 

geca^se the two noise terms are independent, the total noise power present within the resis- 
tor is proportional to the sum of the squares of the shot noise current and Johnson noise 
current. The signal power is just proportional to the square of i\, so that we may write the 
• en al-to- noise power ratio as 

or 

S//N _ 2{i x + i D )cAf+ 4kTF n Af/R L (9A4) 

It j S evident that the importance of the Johnson noise term may be reduced, and the S/N 
value increased, by increasing the value of the load resistor. However, we must remember 
that the detector capacitance is in parallel with the load resistor and, unless some form of 
fre quency equalization is introduced, the bandwidth will be limited to (2!^?^)"' (see eq. 
7 35)- Thus wirh a bandwi dth of A/, the maximum value of R L that may be used is given 

(/? L ) m .x = (27cC j A/)- 1 (9.15) 

huS the maximum S/N value is given by 

i'l 
(S/N)™, = 2{h + .^ w+ mTFAf z Ci (9.16) 



A/v lPLE 9.1 S/N ratio in a p-i-n photodiode 



ytfe consider a p-i-n photodiode which has a quantum efficiency of 0.6 at a wave- 
length of 1 .3 (am and a reverse bias leakage current of 3 nA. It is used in a simple pho- 
toC onductive bias circuit with a load resistor of 50 Q, the system bandwidth is 500 MHz 
a nd 10 u.W of optical power (at 1.3 |im) falls onto the detector. The photogenerated 
cU rrent {ij is given by 

Tl/VX,, _ 0-6 x 10x 10" 6 x 1.6x 10~ I9 x 1.3 x 10" 6 
~fo~~ 6.6xl0" 14 x3xl0 a =6-29u.A 



ly = 



r-pie total shot noise current is given by eq. (9.1 1) as 

A0\ + 'nK« = [2(6.29 x 10- ft + 3 x I0" s ) x 1.6 x 10"" x 500 x 10 6 ] 1/2 = 31.8 nA 
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whilst the Johnson noise current is given by eq. (9. 13) as 

(4 x 1 .38 x 10"- x 300 x 50 x 500 x | ()")'/- 
A', = — = 407 nA 

We see that in this example the Johnson noi.se term is significantly larger than the shot 
noi.se term. The resulting S/N value (assuming an excess noise factor of unity) is then 
given by 



(6.29 x 10" 
(31.8x lCTV + (407x l()-y- 



S / N = ,n to .. „->u , ,,^ .„_.,»■> = 237 or 23.7 dB 



However, depending on the value of the capacitance of the device this might not be 
the highest possible S/N value. Suppose the capacitance is 1 pF: the optimum load 
resistance is then given by eq. (9.15) as 

(/?,,)„,„ = (2jix I x I0- |2 x500x I0 r> )- ! =3 38 £2 

With this value of R L the Johnson noise current is reduced to 161 nA. and the signal- 
to-noise ratio increases to 1463 (or 31.7 dB). 



Example 9.2 gives typical calculations associated with eq. (9. 1 6). As shown in this example 
it is often true that the Johnson noise term is significantly larger than the shot noise. If we 
neglect the shot noise term in eq. (9.16) and also use eq.'(9.1 1) to substitute for ,\ then we 
obtain 

fS/Nl - e T-^ -V) 

( ' Jj,,hnwn_ 8^7^A?q ^- 17 > 

If, however, we wish to consider the absolute limitations inherent in direct detection then 
we may assume that we use a sufficiently large value of/?, that the shot term dominates the 
denominator in eq. (9. 14) (and that we may somehow compensate for the bandwidth prob- 
lems). Thus in the shot-noise-limited situation and assuming that the dark current may also 
be neglected 

(/ ^'ihrtf (9-18) 



9.3.3.2 Noise analysis for an avalanche photodiode 

If the gain of the APD is M, the signal current will be Mi,. We might then also expect that 
the shot noi.se terms would be given by replacing (/\ + /„) in eq. (9.12) by M(i x + i D ). 
However, we must remember that the avalanche process itself introduces excess noise, which 
is allowed for by introducing an excess noi.se factor F(M) (see eq. 7.38). Consequently, we 
must write 



AO'x + i D hn« = M[2{i\ + i D )eAfF(M)] 
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The S/N ratio then becomes 

M 2 i : 



S/N = 



(9.19) 



M 2 F(M)2cAf{i, + i D ) + XnkTF n Af 2 C 

If the factor F (M) were unity, the S/N ratio would increase with increasing M and would 
attain a maximum asymptotic value equal to that given by eq. (9.18) for shot noise alone. 
j-jov^ever, because the factor F(M) increases with increasing M. the optimum S/N value is 
.somewhat smaller than this. Figure 9.18 shows how the S/N value varies with APD gain 
for tvv values of the parameter r which is involved in the function F{M) (r is the ratio of 
e lectron-to-hole ionization probabilities, see eq. 7.38a). Problem 9.7 considers the deter- 
mination of the optimum APD gain in more detail. 

It should be noted that the APD only improves the S/N ratio if the dominant noise term is 
other than shot noise. When only shot noise is present eq. {9.19) reduces to S/N = iJ{F(M)2cAf') 
an 



lC j the S/N ratio is then a factor F{M) smaller than in a p-i-n photodiode. 



a 3. 3- 3 Fundamental limitations on signal size 

y\Itri oU §h in a practical situation Johnson noise may often be the limiting factor, it is of 



300 



S/N 




!() 



10 15 

APD gain 
9.18 The- signa!-to-no,se ratio from an avalanche photodiods as a function of gain as given by eq. 



1 8 ). Two curves are shown each corresponding to differing values of the electron-hole ionization 



probability irl In the absence of any gain the detector would exhtbit a Johnson noise S/N ratio of 10 and , 
; h ot nc»se S/N ra.u, oi MX). We see that as r increases the optimum S/N ratio gets doser to that arisin, 
,n shot noise alone. " 
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interest to .see what the theoretical fundamental limitations on the pulse size actually are. In 
a digitally coded optical signal the smallest conceivable size corresponds to the arrival of a 
single photon. On this scale, however, the statistical nature of the photon emission and absorp- 
tion processes become evident. For example, suppose the average power transmitted cor- 
responds to 20 photons per pulse; then it is possible for pulses to arrive at the detector 
containing perhaps 18 or 24 or indeed any other number of photons including zero. It can 
be shown (ref. 9.6) that the process is governed by Puisson statistics and that the probabil- 
ity p{n,n m ) of detecting n photons per unit time interval when the mean arrival rate is n m is 
given by 

/;( "'"» ) = ~^T ex P ( -"m) (9.20) 

For example, with a mean arrival rate of 20 photons per pulse, the probability that a given 
pulse contains no photons at all is given by 

(20)" 
/;(0. 20) = — r exp(-20) - 2 x 10" V 

Consequently if we have a signal consisting of equal numbers of 'ones' and 'zeros', where 
the 'ones' correspond to pulses containing on average 20 photons and the 'zeros' to the arrival 
of no photons, then there is a probability of 2 x 10" 9 that when a 'one* should be present, in 
fact no photons at all arrive and the pulse is consequently mistaken for a 'zero'. No error 
can be made in any of the 'zeros' since there can be no fluctuations in zero photons. Hence, 
including both 'ones' and 'zeros', a signal containing an average of 10 photons per bit will 
give rise to a BER of I0" 9 . With fewer than an average of 10 photons per bit the BER will be 
worse (i.e. larger than 10" g ); conversely, with more than 10 photons per bit the BER will 
be better (i.e. smaller than I0" y ). A BER of 10" 9 has, in fact, become a fairly standard require- 
ment for optical communication systems. It should be noted that in the presence of even a 
small amount of noise the average number of photons required per bit increases sharply (ref. 
9.7); for example, with a noise signal equivalent to only one photon per bit the number of 
signal photons per bit required to achieve a BER of 10~ 9 increases from 10 to about 22. 

Remarkably perhaps, practical systems can approach to within an order of magnitude or 
so of this fundamantal limit of 10 photons per bit. That is, they can achieve a BER of 10~ 9 
whilst operating with an average of 100 photons per bit. Figure 9.19 shows examples of 
typical detector performances that have been achieved in practice. 

The above discussion applies only to amplitude-modulated direct detection signals; other 
schemes have differing minimum signal requirements. Table 9.2 shows the number of 
photons per bit required for the coherent detection schemes discussed in sections 9 1 1 and 
9.3.9 (see ref. 9.8). 

At first sight the results in Table 9.2 seem rather puzzling, since it would seem that inco- 
herent direct detection is almost the 'best' scheme available, and in view of the complexi- 
ties involved with the coherent schemes (further discussed in section 9.3.9) it would seem 
rather pointless considering them at all. However, it has to be borne in mind that according 
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to eq. (9.3) the local oscillator effectively boosts the photogenerated current by the factor 
( p i.J p \) h2 where P L0 is the local oscillator power and P. is the signal power. This effec- 
tive gain acts, as in the case of the APD, to cause the dominant noise factor at the output of 
the detector to be shot noise rather than Johnson noise. Thus for coherent systems the values 
in Table 9.2 represent much more realistically achievable values than is the case for inco- 
herent detection. 



9.3.3.4 Amplifier front end design 

We have seen in the previous .section that the characteristics of the first stage of the ampli- 
fier immediately following the detector can have a decisive effect on the ultimate perfor- 
mance of the detector/amplifier combination. In this section, we consider in more detail 
several ways in which this first stage (the amplifier 'front end') may be implemented. 

We start with the so-called "low impedance' front end. Referring to Fig. 9. 17, the current 
.source (i.e. the photogenerated current) 'sees' the diode capacitance (C } ) and load resistor 
(R L ) acting in parallel. With an amplifier connected across R L , the bias resistor (R.J associ- 
ated with the amplifier first stage will also be in parallel with R L . The effective load resis- 
tance (RJ is then given by R, = R L R a (R [ + RJ. 

To obtain maximum signal bandwidth (see the arguments leading to eq. 7.35) both R L and 
R, should be as small as possible. The major penalty to be paid with such a design is that 
the system is then likely to be dominated by Johnson noise (see eq. 9.14), and the S/N ratio 
correspondingly poor. If the noise limitations are not a problem, however, then this design 
has the advantage of relative simplicity. The circuit is usually implemented using a bipolar 
junction transistor (see e.g. Fig. 9.20a, which shows a grounded emitter configuration). 

If a low noise level is of paramount importance, then a 'high impedance* front end may 
be considered, that is R a is made relatively large. The disadvantage is then that since the 
electrical bandwidth is limited to (2*/?^)-', the response will fall off at high frequencies 
as 1//, with the capacitor and parallel resistor combination acting to give an output propor- 
tional to the integral of the signal current. Thus it is only possible to have an extended fre- 
quency response by including a differentiating circuit (i.e. one whose output is proportional 
toy ) later on in the amplify chain. One problem with this latter approach is that, although 
it allows an increased bandwidth, it also restricts the available dynamic range, that is in the 
ratio of maximum to minimum usable input signals. A possible circuit for a high impedance 
front end is shown in Fig. 9.20(b). Here, a silicon field effect transistor (FET) is used in a 
common source configuration. The silicon FET exhibits both very hi^h input impedance and 
' ow "° Ise P erforrnance - ° ne difficulty is that the gain of the device is rather limited: above 
25 MHz the gain drops to values close to unity, and if modulation frequencies higher than 
this are required then a bipolar transistor is more often used. 

A considerable effort has also been put into the development of GaAs FETs. since they 
can operate up to several gigahertz and thus offer an alternative to bipolar devices. There 
has been particular interest in hybrid integrated circuits utilizing p-i-n photodiodes with 
GaAs FET front ends. The integration technique enables stray capacitances to be reduced 
to very low values (e.g. approximately 0.5 pF). 

A modification of the high impedance front end is the 'transimpedance' design. Here, a 



FIBER OPTICAL COMMUNICATION SYSTEMS 455 



Photodiode 



— • 



Coupling 
capacitor 



"* K.u, 



\y 



\ 



*l 



(a) 



+ V... 



Coupling 
capacitor 



ph oto 



diode 



is 



9.20 Possible designs for front end circuitry: (a) a low impedance front end using a bipolar junction 
nS istor (the value of /?, is relatively low); (b) a high impedance front end using an Si FET {the value of K, 
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high impedance amplifier is used in conjunction with negative feedback; that is, a portion 
of the output signal is fed back into the input with its phase reversed (Fig. 9.21). If the gain 
of the amplifier is G, then it is easy to show (Problem 9. 1 3) that the magnitude of the output 
can be written 



'\*f 



V ° ut (I + 4n 2 f 2 C 2 Rf/G 2 ) ]I2 



(9.21) 



At relatively low frequencies the output of the detector is given by i x R ( where ;\ is the pho- 
togenerated current and R ( the value of the feedback resistor. At higher frequencies the output 
will decline with increasing frequency, the bandwidth being given by G/{2kC R f ) where C 
is the photodiode junction capacitance. As far as Johnson noise is concerned then it can be 
shown (ref. 9.9) that the effective noise resistance is approximately R { . Thus if R ( is large 
the circuit will exhibit low Johnson noise; at the same time, provided that G remains high 
at high frequencies, the frequency bandwidth will also be relatively large thereby reducing 
or even eliminating the need for any differentiating circuitry. 

9.3.3.5 Detector materials 

The earliest detectors to be developed for use in fiber communication systems were based 
on silicon, which limited the operating wavelength to a maximum of 1.1 urn As we have 
seen in section 9.3.1, it is advantageous from the point of view of both fiber loss and mate- 
rial dispersion to use wavelengths around 1.3 urn or 1.55 urn. Germanium detectors, which 
respond out to 1 .9 urn, are possible candidates. Unfortunately these suffer from much higher 
dark currents than do the corresponding silicon devices. In addition, germanium has a carrier 
ionization ratio, r, of about unity. This implies that the optimum gains in APDs are relatively 




O V 



F1C. 9.21 Design for a transimpedance front end using an FET operational amplifier. At low modulation 
frequencies the voitage output is given by V M , = i K R„ where R, is the feedback resistor and i k the photo- 
generated current. 
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small' and hence also any improvements in S/N ratio. For example, taking r= I in eq. (7.38a) 
in conjunction with eq. (9. 19) gives an optimum gam of 15 (see Problem 9. [ i ). This implies 
an improvement in the optical power S/N ratio of only 8.9 (i.e. 9.5 dB). 

y\s discussed in section 7..1.6.3 various types of detector can also be made from materi- 
als based on Galn.As and GalnAsP which have proved eminently successful for use in the 
, 3 (j.m to 1.55 f.im wavelength region. 

-, 4. Fiber choice 

a s vve have seen in Chapter 8. there are three main types of fiber available, namely step index 
j^ult' 1710 ^ 6 " £ ratleci index multimode and single mode fibers. Step index multimodc fibers 
re relatively inexpensive, have large NA values, but suffer from intermodal dispersion. 
£j ra ded index fibers show greatly reduced intermodal dispersion but have relatively small 
jvTyX values, and can carry only about one-half of the energy that can be earned by a step index 
fiber w i tn tnc same a,re diameter. By their very nature single mode fibers are not subject 
j n termodal dispersion, instead their bandwidth performance is limited by material dis- 



to 

^, n t amounts of power into them: in addition, a much higher accuracy is required in 



rS ion. They have such small diameters that usually only laser sources can couple suffi- 



ci e 

i-jcrnrnent at splices to avoid excessive jointing loss. 
iU& 11 

■p r many applications, a key consideration is the maximum length possible between trans- 
mitter and receiver (or between repeaters). Two factors influence this, namely fiber alien- 
ation and fiber dispersion. In low bandwidth systems, the former is usually the limiting 



tor, whereas in high bandwidth systems it is more likely to be the latter. For example. 
,ose we consider a fiber with an attenuation of 5 dB km" 1 ; if a signal loss of some 40 dB 
be tolerated, then a cable length of 40/5 or 8 km can be used. If the cable were of the 



c an 

p index type, however, with a bandwidth for 1 km of 20 MHz (i.e. a bandwidth-distance 

~ rod ucX of ^ MHz km) ' anci a M § naI bandwidth of 10 MHz were required, then the maximum 



Kfrer length would be restricted to 20/10, that is 2 km. Figure 9.22 illustrates the effects of 
\. e s& f ac,ors on the maximum fiber lengths achievable for several fiber loss and dispersion 
values- 



^5 Optical amplifiers 

the start of section 9.3 we mentioned that over relatively long fiber links one or more 

r>eater units may be required periodically to boost the signal. Such units are relatively eom- 

f ideated in their electrical circuitry requirements and can constitute a considerable portion 

^f the cost of a ,on £ l - iistant -'t ; route. A much more elegant solution would be to develop an 

° it ji optical' amplifier that did not require the optical signal to be turned into an electrical 

•gnal and then back into an optical one. The process of stimulated emission provides a mech- 

ni srn whereby this may be achieved, since, bereft of its mirrors, a laser is simply an optical 

3 ^plifier. In section 5.10.1 the fiber laser was briefly mentioned and is easily adapted to 

3 eC oine an optical fiber amplifier. It is obviously important that such amplifiers operate at 

^ ^lengths of 1 .3 urn and 1 .5 u.m to be compatible with existing optical fiber communiea- 

%>n systems. Although operation at 1 .3 u.m has proved somewhat problematical, amplifiers 
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F1C. 9.22 Maximum repeater spacing allowing for differing fiber attenuations and dispersions as a func- 
tion of bit rate. A total loss of 50 dB at 1 Cbps has been assumed. The attenuation-limited lines are slightly 
inclined owing to reduced receiver sensitivity with increasing bit rate. 



based on erbium-doped fibers, which exhibit gain over a range of wavelengths centred on 
1.55 urn, have been developed successfully. A key factor in this was the discovery that it 
was possible to incorporate relatively high concentrations of erbium into the fiber core (i.e. 
up to 1000 parts per million) when co-doping with alumina (A1,0,) is carried out. 

Ttk energy level diagram for the Er 3+ ion in silica is indicated in Fig. 9.23. The first excited 
state ( I t1/2 ) has a relatively long lifetime and gain is readily obtained once sufficient popu- 
lation inversion has been achieved between it and the ground state ( 4 I IS/2 ). (The system thus 
behaves as a 'three-level system'.) Because the ions are incorporated intoan amorphous solid 
matrix the energy levels are considerably broadened (section 5.10.1) and gain may be 
obtamed over a range of some 30-40 nm. One big advantage is that, unlike the repeater unit, 
the operation of the device is independent of both the modulation coding format and (at least 
up to 100 Gbps) of the bit rate. 

Many of the excited state levels can be used for pumping the laser, including the 4 I 13/? state 
itself. For the present application, it is highly desirable that the pump source is compact^ highly 
efficient and capable of coupling optical power into the fiber with minimal loss. All these 
requirements indicate the use of semiconductor lasers which limits the pumping wavelengths 
that can be used to 807 nm, 980 nm and 1.480 urn. Of these the 980 nm is the most effec- 
tive, with small signal gains as high as 45 dB and incremental gains of 10 dB raW"' of pump 
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c |G- 9 ' 23 The ener 8y levels for Er M -doped silica fiber which are involved in the amplification of light over 
he wavelength range 1520 nm to 1560 nm using stimulated emission. 



?0 v^er having been achieved. Total output powers of up to 500 mW are also possible. By c 
ar ison the 807 nm pump wavelength suffers because atoms in the 4 I t1/ , 



K -ison the 807 nm pump wavelength suffers because atoms in the % v , excited state can be 
e *cit ed by the pump waveJ ength to a yet higher state ( 2 H 11/2 ) thus reducing the amount o( 



pop 



u latJon inversion available. To generate a reasonably uniform gain across the fiber profile 



. is desirable that the fiber be single mode at the pump wavelength. This is obviously easier 
[o achieve when the pump wavelength is 1 .48 urn rather than 980 nm or shorter wavelengths. 
^e efficiency is found to increase when the doping ions are concentrated in the centre of 

the 



c ore rather than being uniformly distributed. 



figure 9.24 illustrates a typical gain curve of a length of Er ,+ -doped silica fiber. Ideally the 
a in curve shouid ^ flat so that a11 signals within the gain profile will be amplified by the same 






, un t; in this case this is clearly not so. The degree of uniformity in the gain is affected bv 
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9 .24 Variation of gam with wavelength for a typical erbium-doped fiber amplifier. The results .,t 



, ra i different pump powers with a wavelength of 1485 nm are illustrated. 
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a number of parameters including the concentrations of the active ion. It is possible to achieve 
a reasonably flat gain profile (or even one which allows for the variation in fiber attenuation 
with wavelength) by inserting a suitably designed optical filter into the system. 

The physical layout of an erbium fiber amplifier is shown in Fig. 9.25. A means has to 
be found to inject both the signal and the pumping radiation into the core of the doped fiber 
section. This is normally achieved by means of a biconical fused fiber coupler (section 8.5.2). 
In this device the fractional amount of radiation coupled from one input fiber to the 'other' 
output fiber varies with the coupling length : as sim(Cr) (eq. 8.43). Since the value of C 
depends on wavelength and since pump and signal are at different wavelengths, it is possi- 
ble to have a situation where the two different input fibers both couple with nearly 100% 
efficiency into the same output fiber as illustrated in Fig. 9.26 (this is not such an easy matter 
when both pump and signal have nearly the same wavelength, as, for example, when using 
the 1.45 urn pumping wavelength). When high output powers are required two pumps can 
be used, one at each end of the fiber. One of the pump beams will then be propagating along 
with the signal beam whilst the other will be propagating against the signal. This arrange- 
ment ensures that the population inversion, and hence gain seen by the signal, remains approx- 
imately constant along the fiber. It is usually necessary to prevent the pump radiation from 
entering the main fiber after the amplifier (or the section before the amplifier if a counter- 
propagating beam is also being used). It is also vital that optical feedback into the gain section 
from reflections outside be eliminated, since such feedback can give rise to unwanted laser 
action. Consequently polarization-insensitive optical isolators are often placed at both ends 
of the gain fiber. It should also be noted that in the absence of optical pumping the 'ampli- 
fier' will become highly absorbing so that backup pumping lasers should always be on hand 
in the event that the original pump lasers fail. 

As tar as amplification at a wavelength of 1 .3 urn is involved the situation is not so clear 
cut. It is possible to use Nd ,+ -doped silica core fibers but efficiencies are low because of 
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FIC. 9.25 The physical layout of an erbium-doped fiber amplifier, in this case the pump and signal beam 
are moving in opposite directions. A filter is inserted to prevent the small amount of pump radiation 
remaining after the gain section from propagating further along the fiber. In addition there are optical 
isolators at either end of the amplifier; these are to prevent any power at the signal wavelength from being 
reflected back into the gain section which could result in the device m ting as a User. More powerful 
pumping can be achieved with two pump beams moving in opposite directions. 
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9.26 Illustration ol how a fiber coupler may be used as a mixer tor two different wavelengths K t and 
We assume that at the input to the mixer the power at wavelength A, is guide I whilst guide > contains 



length A,. The amount of optical power remaining in the initial guide t> shown as a function of 



wave 

c0 Upl' n 8 distance tor the two wavelengths. Alter a distance i m the power at wavelength A, has transferred 
c omP letel V frf>m K uic,e ] l() -- utl,lst lhat at wavelength K, is all in guide 2. Thus if the coupler is 
, rr pinated at this point the two inputs will be mixed together. 



rfre pi" esence oi excitetJ state absorption and competing radiative transitions. Praesodymium 
s also been tried as a dopant but appears to function best in host glasses such as 'ZBLAN* 



h» 



^rF4 _BaF ? -LaF ^ A1F '" NaF ) tnat are difficult to splice onto silica fibers. 

although the Er'-doped fiber amplifier has proved very successful there are ot 
t friit have been P ro P asec! tor lIse as optical amplifiers; one of the most important of these is 
t fre semiconductor laser amplifier. If the end facets of the semiconductor laser are suitably 
c o3 ted tu retiuce ''Action then the device will function as an amplifier. The advantages of 
t fre semiconductor laser amplifier are a much greater freedom in the choice of operating wave- 
}enS ths ' which can be varied b >' var >' in S the composition (and hence bandgap) of the semi- 
Con ductor. and a broader and smoother gain curve. The main disadvantages are that the gain 
■ s dependent on the direction of polarization and that coupling losses between fibers and the 
a ir»P lifiei can b ° ■ si S ,lificant un,ess S reat care is take "- ^ is possible that semiconductor laser 
, ,Tip Iitiers wi!I find £ reater Llse in integrated optical circuits as discussed in section 9.4. 



~ 6 System design considerations 

t his section we briefly consider typical component combinations used in direct detection 
, s tems. Three main factors are paramount in influencing system choice, namely bandwidth, 
a; <imum transmission distance and, of course, total system cost. 

for short distances (i.e. up to a few hundred metres) and for bandwidth* of up to a few 

nS of megahertz a cost-effective combination is a red-emitting LED, a high NA all-plastic 

fi t,er and an Si photodiode detector. More exacting distance and bandwidth requirements may 
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be met with a move to silica-based fibers with an LED (or laser} .source operating at about 
850 nm. The detector may be either a silicon photodiode or a silicon avalanche photodiode. 

When the transmission rate exceeds 400 Mbps or so, the usable fiber length is determined 
primarily by fiber dispersion rather than by fiber attenuation (see Fig. 9.22). It then becomes 
necessary to move to single mode fibers and operating wavelengths close to the material dis- 
persion minimum (i.e. near 1.3 urn in silica-based fibers). Transmission rates of several giga- 
bits per second over fibers of several tens if not hundreds of kilometres in length have already 
been demonstrated or are being built. 

The development of the erbium optical fiber amplifiers has made 1.55 urn an attractive 
wavelength to work at, with the added advantage that fiber losses (in dB km" 1 ) are about a 
factor of two smaller than at a wavelength of 1.3 p.m. The downside to this is, of course, 
that material dispersion will be larger. There are two ways round this: first, if the installa- 
tion is a new one then dispersion-shifted (or dispersion-flattened) fiber can be used. Secondly 
if a 'standard' fiber with a dispersion minimum at 1.3 urn is involved then a dispersion com- 
pensation scheme may be employed. In this a length of fiber which exhibits a high disper- 
sion in the opposite sense to that of the main fiber is inserted just before the receiver. Provided 
the length has been chosen correctly the dispersion in the main fiber may be cancelled out 
by the dispersion in the opposite sense introduced by the compensation fiber to produce a 
'disper.sionles.s' link. Because the link now involves a longer fiber the total losses will be 
increased, but this increase need not be excessive since the fiber is operating at the wave- 
length of lowest loss. 

To determine whether or not a chosen system will perform satisfactorily, a number of 
checks must be made. Obviously both detector and emitter must be capable of handling the 
required signal bandwidth. The fiber dispersion over the length required must not degrade 
the signal excessively, while for a given BER (for a digital signal) there will be a minimum 
average signal power that must reach the detector. If the power launched into the fiber is 
known, together with the fiber attenuation, the maximum length of fiber that can be used 
may be calculated. Allowance must be made for any splices or joins, and a safety margin 
of, say, 5 dB is also usually included. Such a calculation is called a flux budget (see Example 
9.2). It is customary to use a logarithmic power unit called the dBm, where the power is 
referred to 1 mW; thus a power of 10 mW becomes 10 log „,( 10/1) or 10 dBm. 

EXAMPLE 9.2 Flux budget calculation 

Suppose we have to construct a link of length 15 km and bandwidth 100 Mbps. 
Components are chosen with the following characteristics: receiver sensitivity 
-50 dBm (at 100 Mbps), fiber loss 2 dB km" 1 and transmitter launch power (into fiber) 
dBm. It is anticipated that in addition 10 splices will be required, each involving a 
loss of 0.5 dB. The following table itemizes each loss term and allows a ready deter- 
mination of whether the system should operate with sufficient power margin: 

Transmitter output dBm 

Receiver sensitivity -50 dBm 

Required margin 50 dBm 
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System loss: 

Fiber loss 2 dB km -1 , 15 km M) dB 

Detector coupling loss j ( |r 

Total splicing loss (0.5 dB x 10) 3 JB 

Headroom for temperature range, ageing effects and future splices 5 dB 



Total attenuation 



41 dB 



Hence, the excess power margin is 50 - 41 = 9 dB, which should be sufficient for l he- 
operation of the link. 



In Fig. 9.27 we show typical launch powers for both LED and laser sources and also the 
received powers required by state-of-the-art receivers to achieve a 10 ,J BER as a function 
f bit rate. It is obvious from Fig. 9.27 that as the required bit rate increases the relative advan- 
tage of the Iaser/APD combination also increases. 

Some indication of the overall performance of an optical fiber communication system can 
j^ obtained from the so-called 'eye diagram' (Fig. 9.28b). This is the pattern obtained on an 
oscilloscope when the sweep rate of the oscilloscope is set at a fraction of the bit rate, and the 
signal consists of a random sequence of ones and zeros. In an ideal system, where the received 
pul ses were almost completely rectangular, the pattern would resemble Fig. 9.28(a), giving 
a n °P en eye P attem - However, the received pulses will be broadened by dispersion/limited 
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9.27 Simple illustration of the system flux budget for representative systems. The upper shaded areas 



f\0 , _ _ rr .. 

sh oW typical launch powers from LED and laser sources, whilst the lower curves show typical receiver 
J e nsit' vities for a 1 " bit-error rate at 1 3 um wavelength. The curves for the p-i-n and APD detectors are 
direct detection; a further curve shows results for coherent detection. The fundamental detection limit 

' A* nf ^ f,^ l/A nU„tnnr- .-.nr kit- :,- -, lr- „ ,- I 
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responding to 10 photons per bit is also shown. 
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FIG. 9.28 
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(a) An idealized eye diagram where the pulses exhibit little dispersion and jitter. The decision 
decision levels are shown. (I» A typical eye diagram as obtained in practice. 



receiver bandwidth and also subsequent pulses will be subject to a slight random displace- 
ment in time ('jitter'), and thus the eye will be 'closed' to some extent (as in Fig. 9.28b). The 
decision time and threshold level will be set with relation to the centre of the eye and it is evident 
therefore that the extent of eye closure will give an indication of the error rate. 

So far in this section we have not discussed the use of erbium-doped fiber amplifiers 
(EDFAs). Over long distances they (together with repeater units) can be used to boost the 
signal periodically so that on arrival at the detector it is strong enough to give an acceptable 
BER (or S/N ratio). Optical amplifiers can be sited at three different positions in the optical 
fiber link to serve different purposes. First they can be placed immediately before the first 
section of optical fiber where they act as power amplifiers, the advantage being that only a 
low power source laser is required. Although direct modulation of high power lasers is pos- 
sible they are prone to exhibiting a wavelength 'chirp*, that is a change in emission frequency 
throughout the emitted pulse as described at the end of section 9.3.2. This has the effect of 
increasing the effective source linewidth and so limiting the system bandwidth because of 
increased material dispersion. 

Another position for the amplifier is just before the receiver. This boosts the incoming 
signal and provides increased signal sensitivity; here only a low signal power is involved, 
in contrast to the previous usage. It is of interest to note that a fiber amplifier p-i-n receiver 
combination can achieve sensitivities which almost equal those offered by coherent detec- 
tion systems. 

Finally the optical amplifier can take the place of the repeater unit. It should be remem- 
bered that the optica] amplifier can only amplify, it cannot reshape or retime the optical pulses. 
Furthermore the amplifiers do not provide noise-free amplification. Whenever the ampli- 
fier is being pumped there will be some spontaneous emission generated, most of which will 
lie within the gain bandwidth. Some of this radiation will propagate down the fiber along 
with the signal and will be amplified at each subsequent amplifier. This will degrade the 
system performance in two ways: first the S/N ratio will worsen after each amplifier, and 
secondly the amplified spontaneous emission will serve to saturate amplifiers further down 
the chain thus limiting the available gam. 

Finally in this section, to illustrate some of the rapid advances in technology that have 
taken place in optical fiber communication links, we look briefly at three transatlantic 
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telecommunication links. TAT 8 was the first ever optical fiber transatlantic link and was 
installed in 1988. Details are provided in Table 9.3. Also shown are details of TAT 9 and 
-p^A-T 12. The latter, when introduced, provided more capacity than all of the previous links 
put together. It was also one of the first major systems to employ erbium-doped fiber ampli- 
fiers (EDFAs) in place of repeaters. All three systems cover a distance of about 6000 km. 

^3.7 Local area networks 

\ Ithough the initial drive to develop optical fibers has come mainly from the telecommu- 
nications area, that is from the need to develop high bandwidth, long distance, repeaferless 
DO int-to-point links, there are other types of link, for example the local area network (LAN), 
to which optical fibers may make significant contributions. The LAN concept is difficult to 

pecify exactly, but aims to provide a communication network between a group of users 
within a restricted geographical area (e.g. within a factory or group of offices). Typical 
^j-ixjmum distances involved might be a few kilometres. Local networks involving telephone 
links are nothing new, of course, but the availability of much increased bandwidths enables 
se veral other types of communication to be contemplated, for example data communication 
arl d even video links. Situations can easily be envisaged where networks linking up to, say, 
i 0(3 users with data rates of up to several megabits per second may be required. 

One of the main problems with realizing such networks using optical fibers is that the most 
I s s the system can readily tolerate between launch and detection is about 40-50 dB (i.e. 
,g. e Fig- 9-27). With coaxial cable, the corresponding figure is nearly 90 dB. In long haul 

O itimunication links, the advantage this gives to coaxial links is more than offset by their 

elatively high transmission losses. For the comparatively short distances involved in typical 

t A^ s> however, coaxial links do not have this built-in disadvantage. Furthermore, any such 

.ystem will inevitably involve a large number of 'taps' and switches. For an optical system 

t v 1 es e components, at present, involve appreciable inherent losses, whilst the corresponding 

l0 sses in copper wire technology are negligible. Nevertheless, fiber systems do offer the 

^vantages of high bandwidth, immunity from local interference, a higher level of infor- 
a tion security and small size and weight of the cable. 
TTiere are many ways in which the interconnections in LANs may be made with a trade- 

ff occurring between such factors as system complexity (and hence cost) and flexibility, 
trigure 9.29 illustrates several possible topologies for such systems. In Fig. 9.29(a), the 'fully 

-nfleeted' network, every potential user (or 'node') is connected to every other. This arrange- 
ment is complex, expensive and needs considerable rewiring if more nodes are added. Some 

f the links may never be used. It is, however, the most reliable topology, since if one link 

-r A pLE 9 ' 3 S ° me recenf transat l an f'c telecommunications systems 

Year Wavelength !pm) Bit rate Repeater/EDFA spa cing 

200 Mbps SO km (repeater) 
565 Mbps 100 km (repeater! 
5 Gbps 45 km fEDFA) 



1 98H 
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1 989 


1.55 


1995 


1.55 
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FIG. 9.29 Three possible topologies for a local area network: (a) fully connected, (b) star, and (c) data bus. 

is damaged then only this one linkage is affected, and rerouting may be possible to restore 
the connection. 

Figure 9.29(b) illustrates the 'star" network. Here, each node is connected to a central 'con- 
trol' node. If this is 'active', each signal can be switched to its required destination. If 'pas- 
sive', a particular node will direct a signal to all nodes, each of which will then have to decide 
themselves whether or not to accept the signal. The number of links is reduced to a minimum, 
but the reliability of the whole system hinges on that of the central node. If this fails in some 
way, the entire system is likely to break down. Figure 9.29(c) illustrates the 'data bus'; this 
offers great flexibility and is efficient in its use of fiber. 

Some of the optical components likely to be useful in LANs have been touched upon in 
section 8.5.2. For example, the 'mixing rod' type of fiber coupler could form the basis of 
the centra] (passive) node of a star network. Similarly, the fused biconical coupler could be 
designed to provide a small amount of coupling from one fiber into another, thus providing 
a 'tap' as required in a data bus configuration. 

The diversity of possible systems and their realization make any comprehensive discus- 
sion here impossible, and the interested reader may consult the texts of ref. 9.10 for further 
information. 



9.3.8 Wavelength division multiplexing 

One way of increasing the bandwidth of an existing optical link with no modification to the 
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fiber itself is to employ the technique of •.vavelengtli ilivision multiplexing (also known as 
optical frequency division multiplexing). In this, signals on different currier wavelengths arc 
jnixed together (i.e. multiplexed) and then transmitted simultaneously down a fiber. This 
technique has the obvious advantage of increasing the capacily of an optical fiber system 
without changing the fiber. It can also play an important role in networks. For example, con- 
sider a number of users connected to a central hub in the star network configuration of Fig. 
9.29(b). Each user is assigned a particular transmission wavelength and the hub site trans- 
mits all the received messages to the other sites who can then deckle which user thev wish 
to be connected to. 

The signal multiplexing may be carried out using some type of mixer such as the 3 dB 
coupler (as discussed in section 8.5.2). although there will be significant losses associated 
with this. A somewhat more difficult task is the wavelength separalion al the detector end. 
y\gain couplers may be used to split the signal up amongst the number of receivers required. 
-f he individual wavelengths required by each receiver can then be isolated using some ivpe 
f narrow optical bandpass filter (usually of the multilayer dielectric type). Again there are 
j n evitable losses which become increasingly large when the number of separate wavelength 
c hannels increases. Typical channel separations of 4 ran with cross-talk of belter than 30 dB 
j-nay be obtained at 1.3 urn wavelength. The recent development of Bragg fiber gratings struc- 
tures (section 8.7.5.2) has opened up the possibility of in-fiber wavelength-selective devices. 
pr r example, the wavelength notch filter required for wavelength demultiplexing can be 
btained by using Bragg reflection gratings with staggered wavelengths such thai there is 
a transmission gap in their resonant spectra. 

An alternative wavelength demultiplexing technique is to use some type of spatial dis- 
persion element such as a prism or (more usually) a diffraction grating. A schematic diagram 
f such a system is shown in Fig. 9.30. Light from a single fiber is collimated with a lens 
an d then dispersed spatially using a blazed diffraction grating. The lens refocuses each indi- 
vidual wavelength onto its own output fiber each of which is connected to an individual 
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ustrating one method that can be used to separate a wavelcngth-multipleved signal into its 
(or more) constituent wavelength signals. Light (here containing two wavelengths, say \, and \.j iron, 
input fiber is collimated, and then falls onto an inclined diffraction grating. This light is diffracted h M k 
^ u t with different wavelengths being diffracted at different angles. The lens refouises'lhe light on the t.bois 
^jth different wavelengths being focused onto the ends of different fibers. The dilferent waveionglhs are 
^ e n taken to the appropriate detector for that particular channel. 
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detector. Up to about 20 channels can be accommodated with channel separations of down 
to 4 urn. insertion losses of about 5 dB and cross-talk between channels of 20 to 30 dB. More 
details concerning possible wavelength multiplexing/demultiplexing techniques are given 
in ref. 9.11. 

Although the idea of wavelength division multiplexing is attractive its implementation is 
not straightforward, one notable problem being the inevitable power loss that is involved 
in the multiplexing/demultiplexing process. Thus each individual channel has to have a sig- 
nificantly higher power than in a 'normal' single channel system. Where there is quite a large 
number of channels involved the fibers may be carrying a significant amount of total power, 
so much so that non-linear processes such as stimulated Raman scattering and stimulated 
Brillouin scattering (see section 10.1.4) become a possibility. In addition if optical fiber 
amplifiers are used to boost signal levels the relatively large amounts of total power involved 
may cause the amplifiers to saturate and hence reduce the gain available. In addition the prob- 
lems associated with amplified spontaneous emission will be more severe than with single 
wavelength systems. It should also be remembered that the gain i.s not flat over the gain profile 
of the amplifier. The usual way of dealing with this is to use some form of compensating 
filter; however, even small differences in gain can become significant after a number of ampli- 
fications have taken place. 

9.3.9 Coherent systems 

The basic theory behind the operation of coherent systems was discussed in section 9.1.1. 
The output signal from the fiber is mixed with that from a local oscillator (usually a semi- 
conductor laser) and the combined fields are then incident on the detector. In the case of 
heterodyne detection, where the signal and local oscillator frequencies (a\. and w„ respec- 
tively) are different, the output of the detector is given by 

O d - RA^A cos[(oj, - cajA + (<t> t . - «t>J] (9.3) 

In the case of homodyne detection (where w c = w a ) we have 

0, = RA L A cos(^-^) (9.4) 

Homodyne detection i.s difficult because of the problem of ensuring that both the signal and 
local oscillator frequencies remain equal, and it is rarely used in practice. Heterodyne detec- 
tion is certainly practicable, however, and it has two main advantages over direct detection. 
First it can operate with signals which are close to the quantum noise limit and secondly, 
by varying the frequency of the local oscillator, the receiver can 'tune in' to any signal fre- 
quency of interest provided it is within the tuning range of the local oscillator. It thus pro- 
vides a possible solution to the problem of separating individual signals from the group of 
signals involved when using wavelength division multiplexing. However, it has to be admit- 
ted that coherent detection involves a considerable increase in system complexity, and hence 
cost, over direct detection systems. 

One of the obvious requirements i.s for an optical source (the local oscillator) with a 
highly stable output with respect to both amplitude and frequency. In practical terms the 
largest linewidths that can be used for both ASK and FSK are about 20 MHz and for PSK 
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a bout 1 MHz. The standard Fabry -Perot semiconductor laser has a line-width of about 
jO GHz (when directly modulated this may increase to 1000 GHz). It can be reduced in 
jpFB lasers (see section 6.2) to between 1 and 100 MHz (i.e. suitable lor ASK ami KSK), 
^incl in large external cavity lasers to below 100 kHz (i.e. suitable for PSK). As far as timing 
jn to closely spaced signals is concerned field trials have indicated that it is possible lo 
bave channels separated by about 10 GHz. This is certainly superior to typical channel sep- 
Qrations in normal wavelength division multiplexing schemes where the channels have to 
be usually at least 150 GHz apart (i.e. 1 nm apart in wavelength terms). 

Another factor to be taken into account is that of the stability of the polarization of the 
s jgrial within normal communications fibers. When considering coherent deiection wo have 
aS sumed that the polarization state of the local oscillator is exactly the same as that of the 
s ignal. Single mode fiber can support two orthogonally polarized modes and any outside 
disturbances to the fiber, such as temperature or stress changes, can give rise to a varying 
^jnount of coupling between these two (i.e. a type of mode coupling) and hence lead to unpre- 
dictable changes in the polarization state of the signal. This in turn will cause signal 'fad- 
ing' unless measures are taken to counter the problem (if the signal and local oscillator have 
orthogonal polarizations then no signal at all will be received). The use of polarization-main- 
lining fiber is one obvious solution, though it is considerably more costly than standard fiber. 
a rid the fact that all the existing fiber links would have to be replaced make (his solution a 
v jrtual non-starter. A more realistic approach is to track the state of polarization at the receiver 
n d then to change the state of polarization of the local oscillatorto match that of the signal. 
r-p^is is in fact a reasonable proposition since the changes in polarization take place relatively 
i vvly (over timescales of the order of seconds) and the light from the local oscillator can 
ha ve its polarization state changed by passing it through a length of fiber which is subject 
x0 variable asymmetric stress. Figure 9.31 illustrates the schematic layout of such a system. 
r-pfre two input .signals are mixed together using a 3 dB coupler and the radiation from both 
u tput fibers is utilized in what is termed a balanced receiver configuration. It may be shown 
v, 3 t the two outputs from the 3 dB coupler have a phase difference of k. If. therefore, we 
a lce the difference between the outputs of the two detectors by having them connected back 
n back as shown, we effectively double the signal that would be available from each detec- 
toT individually. 

Other techniques which can be used to deal with the polarization stability problem include 
O lai'i~ auon diversity and polarization scrambling. In the former the incoming radiation is 
r>lit ' nt0 two um ' eac ^ P art ^ a " s onto a se P arate detector. The two detectors are connected 
' a balanced receiver configuration and are set up so as to respond to light that is polarized 
orthogonal directions. Whatever the polarization of the incoming radiation one or both 
f the detectors will be able to respond. In the polarization scrambling technique the polar- 
at jon of the source is deliberately varied so that it undergoes a complete rotation of the 
O larization during the bit period. Thus there will always be. at some time during the bit 
e riod' an output from the detector. Both these techniques have been demonstrated suc- 
cessfully, but the latter incurs a 3 dB power penalty, that is twice as many photons per hit 
re required to achieve the same BER as in the ideal coherent system, 
/vt present the number of coherent systems in commercial use is insignificant. This is 



rn 



a inly because the much simpler erbium fiber amplifier -t-p-i-n direct detection systems 



470 OPTICAL COMMUNICATION SYSTEMS 




output 



FIC. 9.31 Diagram showing the layout <>J <i <oherent receiver using a balanced receiver configuration. 



are capable of detection sensitivities which are within a few decibels of those obtainable from 
coherent systems (see Fig. 9.27). This has removed one of the main attractions of coherent 
systems, though their superiority over direct detection in terms of channel spacing and tuning 
range has not yet been utilized. 



9.3.10 Solitons 

One of the fundamental limitations on bandwidth in single mode optical fiber systems is due 
to material dispersion (section 8.3.5). It arises because different wavelengths of light travel 
at different velocities in a dispersive medium. From the discussion leading up to eq. (8.34) 
we may write 

CJT L , d 2 H 1 

dA. ~ c "dl 2 ^ 

where x is the time taken for radiation of wavelength l to traverse a distance L of fiber. At 
a wavelength which is greater than the minimum dispersion wavelength, the quantity 
^dtydX.;; is negative (see Fig. 8.26) and hence dj/d\ } will be positive. In this situation 
longer wavelengths will travel more slowly than shorter wavelengths and any optical pulse 
will spread out in time as it travels down the fiber with the .shorter wavelengths being at the 
leading edge and the longer wavelengths at the trailing edge. However, there is another mech- 
anism present which, under certain circumstances, is capable of opposing this pulse broad- 
ening. This arises because the electric field in the pulse causes a change in the local refractive 
index via the Kerr effect (eq. 3.16). As a consequence a phase shift develops at the leading 
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edg e of the pulse which causes the radiation there to move to lower optical frequencies (i.e. 
a 'red shift' develops). Conversely at the trailing edge there is a corresponding blue shift. 
r-pliis wavelength variation through the pulse is in the opposite sense to that induced by mate- 
rial dispersion and hence there is the possibility that a non-dispersive pulse can be obtained. 
provided that the magnitudes of these two dispersive effects can be made equal. This can 
r^e shown to be possible provided that the pulse shape takes on a particular form, and also 
t riat the maximum pulse power exceeds a particular threshold value (which is dependent on 
both the loss within the fiber and the Kerr coefficient). Within a typical low loss silica fiber 
t fre minimum pulse power required is of the order of a few tens of milliwatts with corre- 
sponding pulse widths of a few tens of picoseconds. The required pulse shape is uiven bv 
(ref- 9.12) 

/(f) = /„sedr(r/7-J (9.22) 

v^here the parameter 7 S varies inversely as the square root of the peak power. It may be noted 
tri at the total energy in such a pulse is 2/ 7* s whilst the pulse width is 1.767 s . Thus the pulse 
^jdth becomes narrower as the total energy and the peak power increases. The non-dispcr- 

j V e pulse is known as a soliton. 1 

Once a soliton has formed it tends to be remarkably stable, passing splices etc. with only 

rriornentary disturbances and adapting its shape to suit the local conditions. However, because 

f fiber attenuation, as the pulse progresses it will gradually lose energy, and hence broaden 

/fig. 9.32). When the pulse energy falls below the threshold value then the pulse loses its 

O liton characteristics. However, optical amplifiers (as described in section 9.3.8 above) can 
^ e used to restore the pulse energy when it nears the threshold value. 

Transmission of soliton pulses at data rates of several gigabits per second over several 
thousands of kilometres of fiber has been demonstrated on a number of occasions (sec e.g. 

e f. 9.13). So far soliton transmission remains a very interesting possibility; whether it will 
yy C used seriously for high bit rate and very long distance transmission remains to be seen. 
although it is an obvious method of overcoming the problems caused by material disper- 
sion which is present in all fibers. 
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t Q. 9.32 The irraduinte profile of a first-order soliton, with a secrr time dependence. 
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Integrated optics 



Although signal transmission using light waves is now well established, the optical signal 
usually has to be converted back into an electrical one if any processing needs to be done. 
The aim of integrated optics (10) is to be able to carry out as much signal processing as pos- 
sible on the optical signal itself. It is envisaged that a family of optical and electro-optical 
elements in thin film planar form will be used, allowing the assembly of a large number of 
such devices on a single substrate. Most device elements are expected to be based on single 
mode planar optical waveguides. Similar advantages are expected to those accruing when 
the idea of the integrated circuit was adopted in electronics. 

The basic concept of 10 was first proposed by Anderson in 1965 (ref. 9. 14) and consid- 
erable progress has been made since then. (For a fairly comprehensive coverage of the field, 
the reader may consult ref. 9.15.) Initially most effort was put into demonstrating the via- 
bility of a wide range of individual devices, whilst latterly interest has concentrated on the 
problem of device integration. One of the main difficulties has been that no one substrate 
is ideally suited for all the different types of device. Many of the earlier composite devices 
were based on hybrid structures (see e.g. Fig. 9.48) but recently more fully integrated 
structures have begun to emerge. 

9.4.1 Slab and stripe waveguides 

It is generally assumed that in 10 the signal will be carried within planar waveguides in either 
slab or stripe form which are formed by modifying the surface of a substrate. Planar wave- 
guides were discussed in section 8.2. but there we confined our attention to symmetrical 
guides. In the present instance we are usually dealing with asymmetrical guides, that is guides 
where the layers above and below the guiding layer have different refractive indices (Fig. 
9.33). The topmost layer (of refractive index n it ) is often air and consequently has a much 
lower refractive index than either the guide layer (/;,) or the substrate (/;,); such a guide is 




Subs"* 



ate 



FIG. 9.33 Slab planar waveguide. The guide itself is formed on a substrate. The medium above the guide 
is usually air. The refractive indices of substrate, guide and topmost layer are n ,, n, and n„ respectively. 
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thus referred to as a strongly asymmetric guide. It is a relatively simple matter to extend the 
t featment of symmetric guides given in section 8.2 to cover the strongly asymmetric case. 
We suppose that the phase changes at the upper and lower interfaces are 4^(9) and 4>,(B) 
respectively, and using similar arguments that led up to eq. (8.9) the condition for a ray to 
j-, e able to propagate becomes 

47t/J,^cos 9 

41,(6) - <|> n (0) = 2mn 

p r a ray to undergo total internal reflection at both the upper and lower interfaces, the 
Internal guide angle must always be greater than either of the critical angles of the upper 
a nd lo wer interfaces (denoted by 6 cl and 9 c2 respectively). Because we have a strongly 
aS yrnmetric guide the critical angle at the upper interface will be much smaller than that 
a l the lower. Thus as far as guided rays are concerned, we can assume that the internal 
gU ide angle will always be much larger than the critical angle of the lower interface, and 
nsequently we may approximate <t> t (6) by n. The propagation condition can now be 
written 

47t/j,dcos9 



*o 



^ 2 (6) + (2/H+l)n (9.23) 



poll° w ^ n § a s ™^ ar 'i ne of reasoning that led up to eq. (8.12), the condition for the in = I 
-node to propagate becomes 

4nn : d cos 9 c2 



>3tc 



Tri contrast to the symmetric waveguide case, it is now possible for no mode at all to prop- 
grate, which happens when 

4jt/i,dcos 9, 2 



K 



■<ji 



frills the condition for only a single mode to propagate now becomes 



471/7,6/ COS 9,.-, 

K< ~. -<3* 



i?y 



putting cos 9 c2 = [1 — (/? 2 A? 1 ) 2 ] 1/2 this condition can be written 



7t 3 71 

4 <V/< T (9 " 24) 

^riere V is as defined in eq. (8. 14). In terms of the guide thickness d we have 
1 d 3 

4(NA) K 4(NA) V-V 

^fiere NA = (n]-nl) U2 . 



474 OPTICAL COMMUNICATION SYSTEMS 

This equation implies that, in contrast to the symmetrical guide case, there is a minimum 
thickness below which it is not possible for the guide to .support a single mode. A calcula- 
tion based on this equation is given in Example 9.3. 

EXAMPLE 9.3 Guide thicknesses for strongly asymmetric waveguides 



Planar waveguides may be made in LiNbO, by diffusing in titanium; a 1% concen- 
tration of titanium causes (at a wavelength of 0.63 j^m) a refractive index change of 
about 6 x 10"\ The ordinary refractive index of LiNbO, is 2.286, so we have n i = 2.286 
and n 2 = 2.280 and then NA = (2.286 2 - 2.280 : ) 1/2 = 0. 1 66. Using eq. (9.25) the require- 
ment for single mode behaviour becomes 

1 .5 1 < d/X t) < 6.02 

The exact value chosen for cl/X depends on the manufacturing process and the use to 
which the guide is being put. Assuming that it is desirable that the guide be as thick 
as possible then a suitable design thickness for such a guide could be d = 5X [y It would 
be unwise to approach the upper limit too closely lest the inevitable fluctuations in d 
that will occur in any manufacturing process cause the guide to become multimode 
in some places. 



The field distribution can also be derived using a similar approach to that used for the 
symmetric guide, and is illustrated for a TE () -type mode in Fig. 9.34. A cosinusoidal vari- 
ation is still obtained across the core with the peak being displaced towards the n./n l inter- 
face. Similarly an exponential decline is observed in the cladding but this is now more rapid 
in the upper medium («„) than in the lower («,). 

The above analysis assumes an infinitely wide waveguide; in practice most waveguides 
used in 10 have an approximately rectangular cross-section so that there is confinement in 
both the ,v and v directions. Some typical waveguide configurations are illustrated in Fig. 
9.35. A further complication is that the waveguides usually have graded index profiles. There 
are no general explicit solutions available for either of these situations. A very simplistic 




FIC. 9.34 The TE„ field distribution 



within a strongly asymmetric planar waveguide, whe 
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flCi. 9 - 35 Two basic geometries used tor making IO stripe waveguides: (a) the channel waveguiife; (1>> the 
ric Jge waveguide. 



vie 



w is to assume that, within the waveguide core, the modal field variation with x and v 



^v-ill be given by the product f v (.v)r v ( v). where the field ^(.v) is the solution to the wave- 

u ide equation given by assuming the guide is confined in the v direction but uneoni'incd 
in they direction (and vice versa for f v (y)). Such solutions can be useful as the starting point 
for more accurate 'perturbation'-type calculations, but with the powerful computing facil- 
ities now widely available, the field distributions are more readily determined from ' first - 
riricip\ e calculations involving the point-to-point solution of Maxwell's equations across 
the guide cross-section. 

jt is possible to make a few simple generalizations based on our understanding to date. 
y^/e would expect that in single mode guides the dimensions in both the .v and y directions 

^ould he of the order of a few times the propagating wavelength (see Example ( JJ). and 
r nat the mode fields should peak within the core of the guides, declining towards the edges 

f the core and having a quasi-exponential decline with distance away from the core in the 
c l*idding regions. 

The three main types of material which have been used as the basis of integrated optical 

.pes of waveguides are various types of glass, materials with high electro-optic coefficients 
, u ch as lithium niobate. and semiconductor materials such as GaA.s. A wide variety of dif- 
ferent techniques (see ref. 9.16) can be used in waveguide manufacture: examples include 

v i\^ term ^ one r >'P e of glass onto another, in-diffusion of a layer of titanium deposited on a 

ut> strate of lithium niobate. and liquid phase epitaxy. This last technique can be used with 
\ e fTiicon6ucUna materials such as GaAs and GaAIAs. The physical extent of the waveguide 
" 0.n °f ten ^ delineated using the same photo- or electron beam lithography techniques that 

r e common in the semiconductor integrated electronics industry. Losses in 10 waveguides 
, r e usually much higher than in optical fibers, being of the order of 0. 1 dB mm" 1 . One reason 
for this is the large scattering from the upper waveguide surface (i.e. the waveguide/air inter- 
face) which is often relatively rough. Another problem, which may hinder miniaturization, 
• s that any bends in the waveguides with small radii of curvature (i.e. less than a millimetre 

f s o) must be avoided, otherwise losses can become prohibitively large (see the discussion 
•^ section 8.4.1). 



4.2 Basic tO structural elements 

yj e consider first of all one of the simplest passive devices, a waveguide splitter where an 
initial single mode waveguide splits into two single mode waveguides of the same width 
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(Fig. 9.36). In the first .section of this device the waveguide expand.s gradually to twice its 
original width of a and then the guide splits into two sections each of width a and angled at 
6 to the original direction. Energy that is in the initial section of the guide will divide itself 
equally between the two output guides. The loss at the splitting point can be estimated by 
calculating the overlap integral (eq. 9.9) of the mode field in the section of width 2a with 
the fields in either of the two output sections which are angled at 6 to it. An example of such 
a calculation is given in ref. 9.17; the outcome is that to avoid excessive loss the angle 
between output waveguides (= 29) must be small (usually 1° or less). It should be noted that 
in this and several subsequent diagrams involving waveguide splitters the angles are shown 
as being much larger than 0.5°; this is merely for pictorial convenience. 

One of the simplest active devices is a phase modulator which is similar to the Pockels 
cell discussed in section 3.4. A stripe waveguide is formed within a suitable optically active 
material such as lithium niobate and electrodes are formed on the substrate surface on either 
side of the guide (Fig. 9.37). If the electrodes are a distance D apart and extend for a length 
L. then the additional phase shift A<j> produced when a voltage V is applied across the elec- 
trodes is. from eq. (3. 14), 



7t .. L 

Ab = — rn*V — 

K D 



(9.26) 



where r is the guide material electro-optic coefficient (see Table 3.1 for representative values). 



(a) 



(b) 




FIC. 9.36 (a) A waveguide 'Y' branch with an angle 29 between the guides. Before the actual branch 
point there is a tapered section where the waveguide width is slowly increased from a to 2a. (b) The 
efficiency of the splitting process as a function of 9 may be calculated by determining the overlap integral 
of the waveguide modes at points just before (A) and just after tBl the branch point. 
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9.37 Integrated optical \ersion or the Pockels cell that may be used as a phase shifter. 



a big advantage over bulk Pockels effect devices is that the ratio L/D may be made rela- 
• v ely large, say about 1000, and a phase change of ?t may then be achieved with voltages 

as 



]otf as 1 V or so (see Problem 9.15). 



y\ high speed switch/modulator may be made by incorporating the phase shifter into one 

- j-jri of 'h e interferometric arrangement shown in Fig. 9.38. This configuration is known as 

\4ach-Zehndcr interferometer. In it the guide splits into two with both paths rejoining after 
n identical path length. With no applied voltage across the phase shifter, the radiation in 
u e two arms will have the same phase when they recombine, and hence the device will not 

- ffect the radiation flowing along the guide. 3 However, if the phase shifter is activated to 
: v e a phase shift of jc. then, on recombining, the radiation in the two arms will destructively 



g 



rfere and no radiation will proceed down the guide (see Problem 9. 15). It i.s easy to show 



y,nl the output of the device will have a sinusoidal dependence on the applied field (see 
c tion 10.1.3.1 which deals with a verv similar problem), and so if the device is biased at 

y^ c half maximum transmission point the output irradiance will vary approximately linearly 
j t }i applied field provided the field variations are small (see Fig. 3.10b for an illustration 
f thi- s )- Devices are commercially available, based mainly on lithium niobate, which have 
O tiulation capabilities of up to a few tens of gigahertz. 



Electrodes 




> Output 



fl G- 9 - 38 lnt(>rt "'' romt ' trK modulator; a voltage applied across the electrodes afiects the retractive index ot 
e upper mjidc over a distance L 
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Some work has also been carried out into the possibility of using oraanie polymers in 
phase shift modulators. These materials offer the advantage of low cost and flexible pro- 
cessing techniques coupled with electro-optic coefficients which are as large, and possi- 
bly larger, than in lithium niobate. However, there are problems with high insertion losses 
(when used in conjunction with silica optical fibers) and the low thermal stability of the 
material. Semiconductor materials such as GaAs and InP do exhibit an electro-optic effect, 
but the electro-optic coefficients are not very large (see Table 3.1). However, semiconductor 
quantum well structures exhibit an electric-ficld-induced change in their effective refrac- 
tive index which is much larger than in the bulk material. 4 Although only some 10% of the 
mode field in such waveguides is actually within the quantum well structure, which reduces 
the effectiveness of the refractive index changes, the changes are still appreciably larger 
than in the bulk material. The interaction lengths required (i.e. the length L in Fig. 9.38) 
is thereby reduced from a few millimetres to several hundred micrometres. This has two 
main advantages: first it increases the device density on the substrate, thereby reducing costs; 
and secondly it reduces the electrical capacitance, thereby increasing the modulation band- 
width. 

A different type of switch/modulator may be constructed which utilizes the coupling of 
energy between waveguides when they are brought into close proximity. We have met this 
phenomenon before when considering optical fiber couplers (section 8.5.2); it arises as a 
result of the overlap of the evanescent field in one guide with the core of the neighbouring 
guide. Using coupled mode theory it can be shown that (ref. 8.28) the amount of energy 
coupled into the neighbouring guide over a length r (Fig. 9.39) is proportional to the factor 
/\„ where, for identical waveguides, 

F, = sm 2 (Cz) 

whilst for non-identical wavecuides 



F. = - 



C 2 



C- + 8 



-,s\n 2 [z(C 2 + ?> 2 ) l/2 ] 



(9.27) 



(9.27a) 
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FIG. 9. 39 



Two waveguides in dose proximity over a coupling dislam e /. A-, explained in the ! 



ext 



evanescent field overlap can cause energy to be transferred between the glides. 
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j-Iere, C characterizes the coupling between the guides and the factor 8 is given by 

n 
5 = — (", -«,,) 

Vf 

v^here /?, and v? n are the refractive indices of the two guide materials ami v is the frequency 
f the radiation. 

If 5 = 0. then after a coupling length z = L { , where L L . = n/{2C), all the power in one guide 
w ill be transferred to the other, whilst after a further distance L c all the power will have 
keen transferred back again (Fig. 9.40a). However, if 5*0 then not only is less energy 
e x changed but also the energy change takes place more rapidly with distance (Fig. 9.40b). 
SupP ose tnat we nave identical waveguides which can be changed to non-identical guides 
ip some way. If the coupling length is equal toL c then, whilst the guides are identical (8 = 0), 
a ll the energy in one guide will couple into the other. However, if the guides arc now made 
n on-identical and with 8 having a value such that L C (C 2 + 5 2 ) = n (i.e. 8 = V.1C), then over 
, A coupling length L c no energy is exchanged. This means we can switch energy from one 
waveguide to another provided we can "switch" from a situation where 8 = to one where 
s== V3C (see Problem 9. 16). 



Relative 



->ower 




flO- 9- 40 Transfer of optical power between waveguides as a function of coupling distance z (see Fig. 
q 39) ace ording to eqs ( l U7) and (9.27a). in (a) the guides are identical and all the energy is exchanged 
^ e tween the guides, whereas in (b) the guides are non-identical and only a partial exchange of energy is 
possible. 
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Several configurations have been built to implement this, of which the simplest is shown 
in Fig. 9.41. Electrodes are deposited above each waveguide and a potential is applied 
between them. The opposing vertical fields in the two waveguides can, if the material axes 
have been chosen correctly, induce opposite changes in the guide refractive indices and hence 
change the value of 5. The main problem with this arrangement is that it is not easy to achieve 
total energy transfer because of the difficulty of ensuring that the coupling length is exactly 
L,.. More complicated electrode configurations have been proposed to overcome these dif- 
ficulties (ref. 9.18). 

Devices such as filters and resonators may be realized in 10 by incorporating periodic struc- 
tures into optical waveguides. Consider, for example, a waveguide with a 'corrugation' etched 
upon its surface perpendicular to the direction of beam propagation (Fig. 9.42). This struc- 
ture is encountered in the distributed feedback laser (section 6.2); it acts as a wavelength-depen- 
dent mirror, that is strong reflection occurs when 2D=m\Jn i , where D is the grating period, 
A.,, the vacuum wavelength. /?, the guide material refractive index and m an integer. Reflection 
bandwidths are usually narrow, but may be increased by 'chirping' the grating (Fig. 9.43). 

Although all the devices discussed above use stripe waveguides, devices can also be based 
on slab waveguides. One of these is a beam deflector based on diffraction from an acoustic 
wave. An interdigital electrode structure deposited on a suitable acousto-optical material (see 
Fig. 9.44) can generate a beam of surface acoustic waves which can then serve to diffract 
light travelling along the guide. The angle through which the beam is diffracted may be 
changed by varying the frequency (and hence the wavelength) of the acoustic wave. This 
device is of course a version of the acousto-optic deflector discussed in Chapter 3. 



, Electrode.v 




Substrate 



FIC. 9.41 An electrode configuration used to modify the propagation conditions in two adjacent 
waveguides in order to alter the coupling of radiation between them. The electric fields are in opposite 
directions through the guides and hence the effective refractive index of one guide wiii be raised whilst 
that of the other will be lowered. 




FIC. 9.42 Waveguide with a corrugation of period D etched 



upon it. 
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Decreasing grating spacing 



F |C- 9.43 'Chirped' diffraction grating structure. A plan view is shown with the vertical tines representing 
th e g rat ' n 8 P ea ^ s (or troughs). 
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fl C- 9-44 beam deflection using diffraction from a surface acoustic wave generated by applying an 
alternating voltage to an interdigitai structure evaporated onto the surface of a piezoelectric substrate. 



When it comes to emitters and detectors the most obvious choice for a substrate would 
se em to be a semiconductor. We have seen in Chapters 6 and 7 how efficient emitters and 
ae tectors can be made from them. However, many of the modulators/switches discussed 
^ b ove were based, not surprisingly, on materials such as lithium niobate, which exhibit re- 
stively high electro-optic coefficients. Semiconductor materials such as GaAs and InP arc 
e Iectro-opticaIly active but have appreciably smaller coefficients, as has already been 
poi nted out. Thus the optimum substrate materials for modulators and emitters/detectors 
do not necessarily comcide. Since it is impossible for tithium-niobate-type materials to be 
^de into emitters or detectors, it would appear that, if complete integration is required 
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.semiconductor substrates will have to be used, probably in conjunction with quantum well 
structure modulators which offer an increased electric field sensitivity. 

In semiconductor lasers the radiation is generated within a channel of similar dimensions 
to those of stripe waveguides, which aids the coupling of power between them. Because of 
the difficulties of obtaining cleaved end mirror.s, however, a distributed feedback structure 
(section 6.2) i.s usually employed, rather than the more common Fabry -Perot structure. One 
problem with the laser as a source is that, when not being pumped, the lasing region is absorb- 
ing; consequently arrangements are usually made for coupling the radiation from the active 
layer into a non-lossy guiding layer .situated beneath it (see Fig. 9.45). Quantum well struc- 
tures are attractive for lasers since they offer a number of advantages such as low threshold 
current, low temperature sensitivity and excellent dynamic behaviour as well as integrating 
well with modulators which are also based on quantum wells. 

Finally in this section we deal with a class of devices known as "bistable optical devices'. 
In these devices two distinct optical states can exist with the possibility of switching between 
them. Such basic devices can form the basis of a whole series of logic gates (e.g. AND, 
NAND, OR and NOR gates). Furthermore if the elements can be switched by light itself 
then we have the basis of an all-optical computer (ref. 9.19). Such a switching device can 
be made by growing a multiple quantum well in place of the intrinsic region in a p-i-n 
photodiode (Fig. 9.46a). We suppose that light of irradiance /„, falls onto one face of the 
device and that an irradiance / out emerges from the other side. The device is electrically 
biased using a battery and a load resistor/? (and hence is known as an R-SEED). Because 
of the presence of the bias field the absorbance of the MQW structure i.s relatively small 
and thus for small values of / in we will have that /,„„ */ in . If / in is now increased then so 
will the current flowing in the external bias circuit. As this current becomes larger so the 
bias actually applied to the device will fall and hence cause the absorption in the MQW 
region to increase (section 3.9), A point will be reached when the device suddenly switches 
to a state where the absorption in the MQW region is very high; a relatively large current 
flows in the bias circuit and the voltage across the cell is very small (which is what causes 
the absorption to be high). If now I m is slowly reduced the high absorption state will persist 
for some time before the device equally suddenly switches back to the low absorption state 
as illustrated in Fig. 9.46(b); thus the switching process exhibits hysteresis. Suppose we 
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FIG. 9.45 lO semiconductor laser based on CaAs/GaAlAs using tiragg reflectors instead of cleaved end 
mirrors. Light from the active layer is coupled into the layer beneath, which then acts as a waveguide. 
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(a) 

p|C 9 - 46 ta) Tne t^'f structure of' an R-SEED switching device, (b) The relationship between the incident 
,/ ) and transmitted (/,„.) irradianres. As explained in the text the device exhibits a hysteresis loop. 



j-, 3 ve a bias beam of irradiance / b where I b is such as to be just less than the irradiance at 
^hich the device switches from low to high transmission states and that, in addition, we 
^gve a signal beam /^ where the combined irradiance / b + / s is such as to cause the device 
to switch just to the low transmission state. If the device is then permanently illuminated 
^jth the bias beam, the presence or absence of the signal beam will cause the device io 
s - v vi ,;C ^ between low and high transmission states. Unfortunately it is not usually practica- 
ble to maintain the constancy of the bias beam with the required accuracy. This problem 
ci xU be solved by replacing the bias resistor with another SEED device, thus generating the 
^yrnmetric SEED or S-SEED (Fig. 9.47). The most important feature of the S-SEED is that 
changes of state occur if the ratio of the two input power irradiances alters; changes in the 




fl C 9.47 Two SEED devices (D, and D_ ( ) connected together to form an S-SEED. 
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absolute irradiances of the input beams do not cause a change of state. The output of the 
device consists of a pair of light beams, one of which will be in the 'on' or high irradiance 
state, the other in the 'off or low irradiance state. In single rail logic the signal beam is 
only incident on one of the devices, but other pulses must be applied to both elements. 
Suppose, for example, that a logical AND operation is required, the sequence of operation 
is as follows: 

1 . Initial 'preset' pulses are applied so that device I is set at hich (D, = I ) and device 2 at 
Iow(D 2 = 0). 

2. The input signal pulses, £., and 5 h , are applied to device 1. 

3. 'Clock' pulses are applied to both devices: the result of a logical AND operation may then 
be read from the output of device 2 and of a logical NAND from the output of device I. 

Suppose, for example, that S a = and S h = 1 ; after S, there will be no change in the state of 
the detectors, but after S h the detectors' states will switch, so that D, = and D, = 1 . The clock 
pulse will then read a logical 1 from device 2 as required (and a logical will be read from 
device 1, giving a NAND operation). The logical operations OR and NOR can similarly be 
obtained by initially setting D, = and D 2 = 1 . By combining S-SEED devices more complex 
devices can be constructed, for example a 2 x 2 switching element can be constructed by using 
six S-SEED elements (ref. 9.20). 



9.4.3 10 devices 

One of the earliest 10 devices to be fabricated was an optical spectrum analvzer, designed 
to display the frequency spectrum of a radio-frequency (RF) signal (ref. 9.21). The layout 
is illustrated in Fig. 9.48. Light from a semiconductor laser is launched into a waveguide 
and the beam subsequently rendered parallel by the use of a 'geodesic' lens. This is a cir- 
cular indentation made in the substrate layer with the guide layer thickness being unchanged 
across the indentation (Fig. 9.49). Such a structure behaves like a 'one-dimensional' lens 
(ref. 9.22). The parallel beam then passes through the acousto-optic beam deflector If a single 
RF frequency is present, the amount of deflection will depend on the instantaneous value 
of the frequency. A second lens subsequently focuses the light onto a particular photode- 
tector in a photodetector array. Each detector element corresponds to a particular frequency 
(or, more accurately, a narrow range of frequencies). If, in fact, more than one frequency is 
present, the light will be divided into different components that are then focused onto dif- 
ferent elements, thus enabling the frequency spectrum of the RF signal to be obtained The 
device as illustrated here i s obviously a 'hybrid' since the emitters and detector elements 
are separate devices which are merely attached to the edee of the substrate. 

Some more recent integrated devices have involved the integration of electronic compo- 
nents (usually gain devices based on transistors) with optical elements. For example a Pockels 
effect-type modulator can be driven from the input of a transistor amplifier so that the actual 
electrical input signal to the device can be relatively small. Other candidates are the drive cir 
cuits for laser emitters and the front end amplifier for an optical detector. As an example of 
the latter. Fig. 9.50 shows a (simplified) schematic diagram of an MSM photodiode (section 
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f\G- 948 Integrated optical spectrum analyzer based on an acousto-optic deflector. 




:|G- 9.49 Cross-section of a geodesic lens in a waveguide. 
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F lO. 9.50 The combination of an optical detector (MSM) and amplifier (MESFET) grown together on ., 
tingte GaAs substrate. 
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7.3.6.5) integrated with a MESFET (Metal-Semiconductor Field Effect Transistor) device 
grown on a GaAs substrate. The GaAs MESFET is somewhat similar to the silicon-based JFET; 
it differs from the latter in that the gate is not a p-n junction but a Schottky barrier metal contact. 
When an external potential is connected between the source and drain contacts a current will 
flow between them through the n-ty pe GaAs layer. If a negative voltage is applied to the gate 
electrode the reverse bias creates a depletion region in the n-type channel under the gate which 
then modulates the gate-source current. Such devices are capable of operating with modu- 
lation frequencies of up to 20 GHz. Unfortunately the reliance on Schottky barriers makes such 
devices impractical for use with narrow bandgap semiconductor materials such as InGaAs/lnP, 
since then leakage currents become excessive. 

Figure 9.5 1 illustrates an integrated laser emitter-external modulator combination for use 
at 1.55 jam which has been demonstrated by AT&T Laboratories. The laser, modulator and 
connecting waveguide are based on quantum well structures and the laser has an external 
Bragg reflector for feedback. Within the laser itself the eight quantum wells are based on 
compressively strained layers of InGaAs/lnGaAsP. whereas within the modulator there are 
10 layers based on InGaAsP/lnP. The whole is based on an InP substrate. Other examples 
of communication-orientated 10 devices that have been successfully demonstrated are 
a balanced heterodyne receiver and a tunable transmitter for wavelength division 
multiplexing. 

Recently integrated optical devices based on the materials Si, Si,_ v Ge, and SiO, have 
become commercially available (ref. 9.23). Silicon is reasonably transparent over the impor- 
tant communication wavelength range 1.3 urn to 1.6 ^m and waveguides can be made by 
having silicon (n = 3.5) as the core material with SiO ; (n = 1 .5) as a substrate material. The 
large refractive index difference between core and substrate means that single mode planar 
waveguides will require very small silicon core regions, but by adopting the ridge wave- 
guide structure as shown in Fig. 9.52 mode field dimensions can be made similar to those 
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FIG. 9.S1 An integrated optical emitter-modulator c ombinaiion based on a quantum well semiconductor 
laser and a Mach-Zehnder-type modulator. 
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|C- 9 - 52 Structure of a ridge waveguide using silicon as the guide layer. 



jji single mode silica fibers. As a material silicon has the advantage of a mature processing 
technology, for example the ability to etch V-grooves in silicon (Fig. 8.36) can be put to 
good use in tr >e alignment of fibers with the waveguides. As has been pointed out previously 
(section 4.6. 1 . 1 ) it is not possible to make efficient emitters based on silicon, although various 
types of modulator are possible, for example Bragg reflector types (ref. 9.24). Detectors which 
^ake use of interband transitions in Si^Ge, are also possible, although the fact that the 
material is an indirect bandgap semiconductor means that relatively long lengths of detec- 
t0 r material are needed to absorb appreciable amounts of radiation. Available commercial 
j^vices often use hybrid emitters and detectors butt coupled to the waveguides. 

By no means are all applications of 10 communication based: for example, there are 10 
ve rsions of the active matrix LCD display (Fig. 4.31) and the read head of the compact disc 
layer. However, the number and complexity of possible 10 devices is much too broad to 
^e adequately dealt with here and the interested reader is directed to ref. 9.25 for further infor- 
mation. 
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The absorption spectrum is shifted somewhat with respect to the fluorescence spectrum 
and pumping at 1480 nm is possible. 

A soliton was in fact first observed in 1832 in the form of a water wave on a canal. 
In fact there is a loss of radiation, although to simplify the discussion we may ignore it, 
.see ref. 9. 15a. 

The refractive index changes result from changes in the absorption coefficient when a 
field is applied, see section 3.9. 



PROBLEMS 



9 j How accurately may the amplitude of a signal waveform be specified using an 8 bit 
number? 
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9.2 The analog modulation bandwidth required for television is about 4 MHz; what is the 
minimum bit rate required for a digital transmission? 

9.3 Estimate the maximum range for a iine-of-sight optical communication system using 
mirror collimators of 0.1 m diameter and operated at 0.85 urn. The minimum power 
required by the detector is 10~ 9 W and the emitter launches 1 mW of radiation. Take 
into account losses arising from both (a) beam diffraction and (b) atmospheric attenu- 
ation (which you may assume to be 10 dB km" 1 ). 

9.4 An LED source emits 1 m W from an area 1 00 urn in diameter. It is butt joined to a step 
index fiber with a core diameter of 50 pm and core refractive index of 1 .48. The cladding 
has a refractive index of 1 .46. Assuming the LED source is a Lambertian emitter, esti- 
mate the energy coupled into the fiber. 

The much narrower beam divergence of lasers enables them to couple substantially more 
power into a fiber than LEDs. A convenient expression for their surface luminance as 
a function of angle is given by 5(9) = fl(0)cos"9. A Lambertian source is characterized 
by n = 1, whereas lasers may have n * 20 or more. Using this expression, calculate the 
coupling efficiency into a fiber as a function of n assuming NA <s I. 

By assuming that the ^ field profile in single mode fibers is Gaussian (i.e. 
'£<)('") = r £ (0)exp[-(r/co () ) 2 ]), show that if a Gaussian beam is focused onto the end of 
the fiber so that its field profile is t B (r) = r £ B (0)exp[-(r/co B ) 2 ] then, ignoring any Fresnel 
reflection losses, the coupling efficiency, r\ c , may be written 

4W( 2 x0o 
rj e = - ° B 



9.5 



9.6 



(Wq + CQ 2 ) 2 



(Hint: write eq. 9.9 in terms of radial coordinates before evaluating the overlap inte- 
gral.) 

9.7 By assuming that the mode field within a single mode fiber is Gaussian (so that 
£(r) = £(0)exp[-(r/co ) 2 ]) and by ignoring any Fresnel reflection effects, show that the cou- 
pling efficiency between two such fibers as a function of a separation z can be written as 
= 40+^ 
nc (2+x 2 ) 2 

where a' = z^ /(tcco ). (Hint: use eq. 9.10 in conjunction with eq. 5.34.) 

Sketch a graph of the variation of n. c with x. 

In a demountable join involving single mode fiber, the fibers have a separation 
between their end faces of 10 urn. If the fibers involved have core diameters of 8 pm, 
numerical apertures of 0.10 and the wavelength of radiation involved is 1.3 urn, esti- 
mate the resulting coupling loss. 

9.8 An emitter can couple some I0" 1 W of optical radiation at 900 nm into a fiber which 
has an attenuation of 5 dB km" 1 . A receiver is used which requires an average of 500 
photons per bit for an acceptable bit-error rate. Plot a graph of the maximum fiber length 
possible for bit rates of between 10 5 and 10 9 bps. Modify this diagram to include the 
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effects of fiber dispersion (5 n.s km -1 ) if the maximum allowable pulse spreading must 
not exceed 1/(2 x bit rate). 

9.9 A photodiode is used in the photoconductive mode in conjunction with a load resistor 
of 50 Q. Optical power of 6 uW at a wavelength of 800 nm falls onto the detector. The 
light is amplitude modulated at frequencies up to 1 00 MHz, Assuming the detector has 
a quantum efficiency of 0.7. and a reverse bias saturation current of 2 nA, calculate 
the resulting signal-to-noise ratio. 

If the photodiode has a capacitance of 5 pF, determine whether the choice of a 50 i2 
load resistor is optimum from a signal-to-noise point of view. If it is not determine the 
optimum value and the corresponding signal-to-noise value. 

9 -f O Show that the optimum gain of an avalanche photodiode may be obtained by solvinu 
the equation 

x dF(M) 2 x Johnson noise 
M' 



AM shot 



noise 



where F(M) is the excess noise factor. Hence, by substituting for F(M) from eq. (7.3Ka) 
and for the Johnson and shot noise terms from eq. (9.1 1) and eq. (9. 12) respectively, 
show that the optimum signal-to-noi.se ratio may be obtained by solving 

9 1 1 Determine the optimum gain for an avalanche photodiode made from GaAs where the 
electron-to-hole ionization ratio, i\ is 5. Assume the ratio of amplifier to shot noise is 
1000. Estimate the improvement in signal-to-noise ratio obtained in this instance bv 
using an APD rather than a p-i-n photodiode. 

Repeat the calculation assuming an avalanche photodiode made from germanium 
where /-= I . 

9 j 2 Assuming that the noise in a detector is governed by statistical fluctuations in the photon 
arrival rate, calculate the minimum signal power level at a wavelength of 1.3 urn 
required to achieve a BER of I0~ 12 when the bit rate is 5 Gbps. 

9 j 3 Show that the magnitude of the output signal voltage of the transimpedance amplifier 
of Fig. 9.21 can be written 

V =— k ' 

OIll 



(\+An 2 f 2 C]R}lG 2 )'' 1 

where G is the gain of the operational amplifier. Assume that the value of G is large 
and remains independent of frequency. 

g j 4 It is required to construct an optical fiber communication link 5 km long with an analon 
modulation bandwidth of 10 MHz. The proposed system uses muitimode step index 
fiber with core and cladding refractive indices of 1.48 and 1.46 respectively At the 
operating wavelength the fiber exhibits an attenuation of 3 dB knT 1 . The emitter is an 
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LED which can launch 200 mW of optical power into the fiber and the detector a p-i-n 
photodiode which requires -40 dBm to achieve the required signal-to-noise ratio. No 
splices are required. The detector and emitter have rise times of 100 ns and 10 ns respec- 
tively. Determine whether the link should operate satisfactorily and if not discuss the 
component changes you would recommend. 

9.15 A waveguide modulator of the Mach-Zehnder type for digital modulation is to be made 
based on lithium niobate as a substrate material for use with radiation where X t) = 1.3 urn. 
The guides themselves are 2 u.m wide and the voltage to be applied is 5 V. Estimate 
the length of waveguide over which the voltage must be applied. Assume that the trans- 
verse Pockels effect coefficient in lithium niobate has the value 30 x I0~ i2 m V"',and 
that the ordinary refractive index is 2.2. 

9.16 A modulator is to be made based on the coupling of radiation between two closely 
spaced identical waveguides. With no voltage applied the coupling length for 
X„ = 0.8 p.m is 10 mm, and this is the actual length of the coupling region in the device. 
In this condition radiation launched into one guide will be wholly coupled into the other 
guide at the output. Voltages are applied across the waveguides over this length so that 
the transverse fields produced within the guides act oppositely to each other. If the 
waveguides are 4 urn wide, determine the approximate magnitude of the voltage which 
will cause the radiation to switch back to the original guide at the output. Assume that 
the transverse Pockels effect coefficient in lithium niobate has the value 
30 x 10" 12 m V" 1 , and that the ordinary refractive index is 2.2. 
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( CHAPTER 10 

Non-communications applications of 

fibers 



Although the main driving force in the development of optical fibers has been their use for 
telecommunications purposes, other, perhaps more mundane, uses promise to be of equal, 
if not greater, importance. In this chapter we examine some of these. 



Cl2iL) Optical fiber sensors 

In many industrial processes there is a need to monitor such quantities as displacement, pres- 
sure, temperature, flow rate, liquid level and chemical composition. Ideally, the measure- 
ment technique should be reliable, robust, corrosion resistant, intrinsically safe and free from 
external interference. Obviously, optical fibers and indeed optical methods in general have 
the potential for making a significant contribution in this area. In many cases, however, well- 
developed alternative technologies already exist and it may be some time before optical' fibers 
can make any really significant penetration into the market. 

Optical fiber sensors themselves can be divided into two main categories, namely 'intrinsic' 
and 'extrinsic'. In the latter, the modulation resulting from a change in the measurand takes 
place outside the optical fiber, which acts merely as a convenient transmission channel for 
the radiation. In intrinsic sensors, on the other hand, the quantity to be measured acts directly 
on the fiber itself to modify the radiation passing down the fiber. 

As far as intrinsic sensors are concerned, it is possible to modulate the amplitude, the phase 
or the state of polarization of the radiation in the fiber. In multimode fibers, however, mode 
coupling and the usually random relationships between the phases and polarization states 
of the propagating modes generally preclude the use of either phase or polarization modu- 
lation. Thus, intrinsic multimode fiber sensors almost invariably involve amplitude modu- 
lation. On the other hand, with single mode fibers both phase and polarization modulation 
become possible. Phase modulation opens up the possibility of fiber-based interferometric 
sensors that can offer exceptionally high sensitivities. It is convenient therefore to treat mul- 
timode and single mode intrinsic fiber sensors separately. 

1 0.1 .1 Multimode extrinsic optical fiber sensors 

Some of the simplest extrinsic fiber-based sensors are concerned with the measurement of 
movement or position. For example, when two fiber ends are moved out of alignment, the 
coupling loss depends on the displacement. The actual relationships between loss and dis- 

492 



OPTICAL FIBER SENSORS W.i 



placement were considered in section 8.5 (see also Problem 8. 17). A similar type of sensor 
uses a shutter moving between two fiber ends that are laterally displaced (Fig. 10. la). 
imP rOVements in sensitivity are possible by placing a pair of gratings within the gap, one 
fixed, the other movable (Fig. 10. lb). Here, however, although the sensitivity has increased, 
the range has decreased, since the output will be periodic in the spacing of the grating. The 
r ang e of movement possible for the single shutter sensor is obviously limited by the fiber 
core diameter. If a beam expander is employed between the fibers (Fig. 10. 1c). then the range 
can t> e g reat L v expanded. The displacement-transmission characteristic of the expanded beam 
s ystem is considered in Problem 10.1. 

One of the first commercially available displacement sensors was the 'Fotonic sensor'. 
-TTiis uses a bundle of fibers, half of which are connected to a source of radiation, the other 
half to a detector (Fig. 10.2a). If the bundle is placed in close proximity to a reflecting 
sU rf^ ce then light will be reflected back from the illuminating fibers into the detecting 
fitters. The amount detected will depend on the distance from the fiber ends to the surface. 
To an a '>' ze this dependence, we consider the somewhat simpler situation where there are 
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fxC . 10.1 Simple displacement sensors. In (a) a movable shutter varies the light coupled between Kvo 
|t)rl gilud.n.,llY d.spU ed libers. In fb), the use of two gratings increases the sensitivity. In «-). a | XM 

^ mt.i( m k\/Jom nn il^iiii in in/-r,,..,. :., *l .- ..... 



iansion system enables an increase in th 



ie range of measurable displacements to be increased. 



494 NON-COMMUNICATIONS APPLICATIONS OF FIBERS 



Reflecting 
surface 



Licht in 




l-ight out 
(to detector 



(b) 



Reflecting 
surface 



I 
Receiving fiber core 7. 

f/ Image of 



Emitting 
liber core 




z emitting 

£ liber core 



(c) 



Coupling 4 
-'fficiency (%) 



Separation for maximum sensitivity 




Distance, d (urn) 



200 



FIG. 10.2 illustration of the Fotomc sensor. The general layout using fiber bundles is shown in (a). A two- 
fiber version is shown in (b), which can be used to derive the form of the output-distance id) relationship. 
The typical result of such a calculation taking a = 100 urn, ,s = 100 um and NA = 0.4 is shown in (c). 



just two fibers. If we regard the reflecting surface as a mirror, the problem then reduces 
to that of the coupling between two fibers that are displaced both laterally and longitud- 
inally (Fig. 10.2b). The form of the relationship between displacement and light output 
may be determined by considering the overlap between the sensing fiber core area and the 
cross-section of the light cone emitted by the image of the emitting fiber. We can readily 
appreciate that at very small fiber-surface distances, no light will be coupled between the 
two fibers. Then, beyond a certain critical distance, there will be an increasing overlap 
between the above areas and the coupled radiation will increase rapidly. Once the detect- 
ing fiber area is completely filled, however, the output will fall with increasing distance. 
At large distances, an inverse square law will then be obtained. The displacement output 
characteristic is considered in more detail in Problem 10.2, and a typical result of this analy- 
sis is shown in Fig. 10.2(c). 
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In practice, when a fiber bundle is used instead of just two fibers the displacement-output 
c Y\£ura-Ctenstic will be somewhat different and will depend on how the emitting and receiving 
fjt>e rs are distributed (usually randomly), but the overall shape remains similar to that of Fig. 
|0-2( c )- Because of the very non-linear narure of the curve, the sensor is not very suitable for 
l\\& measurement of large displacements, although it is possible to increase the range by using 
a lens system. In fact, the sensor was developed originally for non-contact vibration analysis. 

y\ny displacement measurement technique is readily adapted to the measurement of prcs- 
sU re, ar) d those mentioned above are no exception. For example, the Fotonic sensor may be 
r>la.ced close to a reflecting diaphragm with a constant pressure maintained on the sensor side. 
Any change in external pressure will cause flexing of the diaphragm and a consequent change 
in *fr e ^ nsfrumenEs output. It should be remembered, however, that none of the instruments 
j e scribed above is linear except over a very limited range of displacements. Accurate cali- 
br ati°n over the whole range is therefore required. 

^s well as displacement/pressure sensors, a number of extrinsic fiber temperature sensors 

h3 ve k een proposed. For example, the bandgap of semiconductors such as GaAs is tem- 

erat ure dependent (Fig. 1 0.3a) and a simple sensor can be made in which a piece of the 

er niconductor is placed in the gap between the ends of two fibers (Fig. 10.3b). Light with 

a v^avelength corresponding approximately to the semiconductor bandgap is sent down one 

f the fibers and the power emerging from the other is measured and can be related to the 

te rnp erature - 



(a) 




Increasing 
temperature 




GaAs 



To detector 



a) Schematic variation of the absorption coefficient (a) of GaAs with both wavelength (A) and 



c.\(j- lu -- r ..~.. wviiiunii vw ui vjan:> wiin Duin wttveien^ui \r\\ a 

^rriperature. (b) Temperature sensor utilizing variations in the transmission of GaAs with temperature. 
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Another temperature sensor, the Fluoroptie sensor, is available commercially and is 
claimed to have a sensitivity of 0. 1 °C over the range -50 C C to 250°C. The instrument relics 
on the temperature variation of the fluorescence in europium-doped lanthanum oxysulfide 
(Eu:La 2 2 S). A small amount of this material is placed on the object whose temperature 
is to be measured, and the fluorescence is excited by illuminating it with ultraviolet light 
transmitted down a fairly large diameter (400 rim) plastic-coated silica fiber. The source 
of radiation is a quartz-halogen lamp whose output has been filtered to remove any 
unwanted higher wavelengths. Another, similar, fiber picks up some of the emitted 
fluorescence and carries it back to the detector system (Fig. 10.4). In fact, the phosphor 
emits at more than one wavelength and it is the intensity ratio of two of the lines which is 
measured. Because a ratio is measured, any fluctuations in the irradiance of the source are 
not important. 

At the detector end of the fiber, the radiation is split into two using a beam splitter. Each 
beam falls onto a .silicon photodiode having an appropriate filter in front to isolate the par- 
ticular wavelength required. The ratio of the two signals then provides the temperature infor- 
mation, which is usually contained in a 'look-up' table. Because the ultraviolet output from 
a quartz-halogen lamp is small and the fiber absorption relatively large at short wavelengths, 
the output of the phosphor is quite small. Efficient detectors and low noise preamplifiers 
are required, and the maximum fiber length is restricted to about 15 m. Nevertheless, the 
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FIG. 10.4 Schematic layout of the Fiuoroptic temperature sensor. The fluores< enl radiation generated in 
the phosphor is separated into its two main constituent wavelengths (A, and k ,) and the relative optical 
power of these wavelengths is determined by using a beam splitter tollowed by two optical bandpass filters 
to isolate the two wavelengths. 
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device provides a performance superior to thermocouples, and allows point temperature mea- 
surements in semiremote hostile environments. 

I 0.1 .2 Multimode intrinsic optical fiber sensors 

Qne of the ways in which we can influence the amount of radiation flowing down a fiber is 
ky means of microbending loss (section 8.4. 1), and this can therefore be made the basis of 
a displacement or pressure transducer. In a typical device, the fiber passes between a pair 
Q f ridged plates which impart a periodic perturbation to the fiber. In fact, we have met such 
&n arrangement before in the guise of a mode scrambler (section 8.6). If step index fiber is 
used, a particular periodic perturbation of wavelength A will only couple together a few 
^odes (see eq. 8.30). However, it may be shown (ref. lO.l) that, with a graded index fiber 
w Ucre the profile parameter, a, is equal to 2, all modes are coupled together when 

A - = vs (,lU) 

typically A c ~ l mm (see Example lO.l), and with this value for A we expect very high 
^icrobending loss and hence high sensitivity for the sensor. 

cj^MPLE 10.1 Microbending sensor periodicity „ __ 



We consider the periodicity required for maximum microbending loss in a graded index 
fiber. We assume the fiber has a 50 urn core diameter and maximum core and cladding 
refractive indices of 1 .48 and 1 .46 respectively. We have therefore that a = 25 um and 
that (from Example 8.5) A = 0.0135. Hence from eq. (10.1) 

2ttx25x 10 -6 

A = th in 

(2x0.0135) l/: 

= 0.96 mm 



When the modes in a fiber are excited by a coherent source, they are capable of interfer- 
ing with each other and thus of P roducm g an interference pattern across the end of the fiber. 
^e pattern obtained will depend on the phase differences developed between the modes as 
t h e y traveI alon S the fiber > wnich is impossible to predict. Provided there are no perturba- 
t j ns acting on the system, however, the pattern should remain unchanged. If now the fiber 
■ lS slightly flexed in any way, mode coupling will change the distribution of energy amongst 
tJie modes, and hence produce a change in the interference pattern across the fiber end. Of 
c£? urse, unless there is a significant amount of coupling into lossy modes, there will not be 
^ n y great change in the total amount of energy emerging from the fiber. If, however, we con- 
sider only a small portion of the whole area of the fiber end, any change in the interference 
pattern as a whole is almost certain to produce quite significant changes in the emerging 
energy- Thus, if a detector is so placed as to intercept only a smalt portion of the total light 
^merging from the fiber, its output should vary when there is any deformation of the fiber. 
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By its very nature, such a detector will be very non-linear, though in some circumstances 
this may not be a great disadvantage. For example, by laying the fiber just below ground 
level it may be possible to detect the presence of intruders, since their footsteps will cause 
deformation of the fiber. All that is required is for the output to trigger an alarm when the 
change in the signal exceeds some predetermined level. 

The Bragg fiber grating structure (section 8.7.5.2) can be used as a very useful sensing 
element. It will be recalled that the grating will reflect radiation of wavelength Jt B which sat- 
isfies the equation 

X g = 2mn l A q .2) 

where m = 1 , 2, 3, etc., A is the periodicity of the grating and n l the refractive index of the 
core. The exact value of the product w,A will depend on both temperature and strain within 
the fiber. As far as temperature changes are concerned both the grating wavelength and the 
refractive index will be affected by temperature and we can write 

AX B = ImAiul + . ] AT 

\«, 6T A dT) 

This can be written as 

AX B = X B (p + a)AT (10J) 

where j} = (d^/dT)/", and a is the linear expansion coefficient. Similarly the application of 
strain (e) will affect both the grating spacing and the refractive index (via the photoelastic 
effect), and we may write (ref 10.2) 

AX B = X B (\-p e )Ae (ia4) 

where p e is an effective photoelastic coefficient given by 



where P n andP 12 are Pockels coefficients and u is Poisson's ratio. Example 10.2 calculates 
the resulting sensitivities for silica fiber. 

EXAMPLE T0.2 Bragg fiber grating sensitivity 



For pure silica the thermal expansion coefficient (a) is 5 x 10" 7 K" 1 whilst the quantity 
{AnJdDln, (i.e. 0) is 6.8 x 10~ 6 K" 1 . Thus according to eq. (10.3) the wavelength sen- 
sitivity to temperature changes of the fiber grating structure, assuming X B = 1 .55 um, is 

AX B = 1.55 x 10" 6 (5 x 10" 7 + 6.8 x 10" ft ) = 0.01 13 nm K -' 

As far as sensitivity to strain is concerned we have that P u =0.126, />,, = 0.274 and 
u. = 0.17. It then follows that 

1.46 2 
P e = — r— [( 1 - 0. 1 7) x 0.274 - 0. 1 7 x 0. 1 26] = 0.229 
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Thus as far as strain is concerned 

AX B = 1.55 xlO" 6 (l- 0.229) = 1.2 xlO" 6 me" 1 or 1.2 x 10" 1 nm ue 



There are a number of ways in which the Bragg sensor can be 'interrogated' to obtain 
a rneasure of the reflection wavelength A. B . For example, if radiation from a tunable laser 
j S incident on the grating and its output wavelength scanned across the appropriate wave- 
length range then strong reflection will be obtained at X B . The magnitude of the back- 
r eflected radiation is easily monitored using the set-up illustrated in Fig. 10.5. Several such 
s ensors may be employed at different positions along the fiber provided that the wavelength 
ranges associated with the sensors are mutually exclusive and also that the laser scanning 
range is sufficiently large. The wavelength of each sensor is determined by correlating the 
time at which a reflected pulse is detected to the laser wavelength at that time. In another 
jTieasurement technique radiation from a broad-band source is sent down the fiber. Light 
a t wavelength X B will be removed from the beam, leading to a 'notch' appearing in the trans- 
nnitted spectrum and either the reflected or transmitted spectrum can be analyzed to obtain 
%b- However, the changes in A. B are small and difficult to measure directly except with costly 
instruments such as the optical spectrum analyzer. A number of possible measurement 
sC hemes have been proposed most of which involve matching the wavelength X n to 
t he resonant' wavelength of some other optical system such as a Fabry -Perot inter- 
ferometer. 

Bragg grating sensors offer a number of advantages over other types. For example, they 
ffer a relatively high resolution of strain or temperature, the output is a linear function of 
the measurand and they are insensitive to fluctuations in light intensity. In addition they are 
relatively easy to fabricate and do not compromise the structural integrity of the fiber. 

We know from our discussion of fibers in Chapter 8 that the mode field extends out into 
the cladding. If, therefore, over a certain portion of a fiber we have a cladding material which 
changes its optical properties (e.g. refractive index or absorption coefficient) in response to 



Bragg fiber gratings 




Photodiode 



F IG. 10.5 Illustration of a technique that can be used to interrogate an array of Bragg fiber diffraction 
gratings. The photodiode will only recerve a signal when the output of the laser corresponds to one of the 
reflection wavelengths (\,, K 2 , . . ., K n ). 
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FIG. 10.6 Raman scattering spectrum in silica; the scattered frequency differs from the incident frequency 
by an amount Av. 



changes in the surroundings, then the mode field will be affected to some extent. Sensors 
relying on this basic principle have been made to measure liquid refractive indices and various 
ionic concentrations and pH values (ref. 10.3). 



1 0.1 .3 Distributed fiber sensors 

Most of the sensors discussed so far have been concerned with the measurement of physi- 
cal parameters at a particular point in space. However, there are a number of situations 
where it is highly desirable to monitor a physical parameter continuously along a length; 
whilst it is always possible to use a large number of point sensors this makes for a very 
cumbersome (and costly) solution. A much better idea is to use a single fiber and to monitor 
some property dependent on the required measurand at the required number of points along 
the fiber. The flexibility of fiber makes it relatively easy to install over a chosen meas- 
urement path and thus allows retrospective fitting. The value of having access to the 
spatial/temporal behaviour of strain and temperature in. for example, large critical struc- 
tures such as dams, aircraft, spacecraft, bridges, etc.. is clear both from the point of view 
of safety monitoring and for a better understanding of behaviour under unexpected 
conditions. 

The principle of the OTDR (section 8.6.6) provides a means of achieving the spatial res- 
olution; thus the time delay between a launch pulse and a returning pulse is readily converted 
into the distance along the fiber to where the returning pulse was generated. In the OTDR 
the radiation received originates in Rayieigh scattering, which in silica is relatively insens- 
itive to temperature. Some early demonstrations used liquid core fibers where the scatter- 
ing is much more temperature dependent. However, such fibers are not generally suitable 
for practical applications. Another possibility is the use of fibers that have been doped with 
rare earth elements (e.g. the Er 1+ -doped fibers used in optical amplifiers), in which some of 
the absorption features vary with temperature and hence can influence the gradient of the 
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OTDR trace. For example, a holmium-doped fiber has shown a temperature sensitivity of' 
a bout 1°C with a spatial resolution of about 4 m. 

Other sensors have been developed based on scattering mechanisms which involve the 
interaction of the radiation with the vibrations of the lattice. In terms of quantized vibrations 
these may be described as photon-phonon interactions. There are two main types of lattice 
vibration (or phonon) involved: one in which neighbouring atoms are moving with (almost) 
t he same phase, giving rise to acoustic phonons; and the other in which neighbouring atoms 
ar e moving almost 180° out of phase, giving rise to optical phonons. Optical phonons can 
only exist in materials where the unit cell contains more than one atom. During the inter- 
action process the photon can either create or absorb a phonon, so that the energy of the 
photon is changed. If a phonon is created the photon energy will decrease and hence also 
its frequency: in this case we talk of a Stokes shift. When a phonon is absorbed, on the other 
n and, the photon energy and frequency will both increase and we have an anti-Stokes shift. 
-phe scattering arising from optical phonons is called Raman scattering, whilst that from 
acoustic phonons is called Brillouin scattering. Since optical phonons have energies which 
ar e several orders of magnitude larger than acoustic phonons, the frequency shifts involved 
are also correspondingly larger. Another difference is that the magnitude of the shift in the 
c& se of Brillouin scattering depends on the scattering angle of the photon; the frequency shift 
a ttains its maximum value for backward scattering and falls to zero for forward scattering, 
because the material we are mainly concerned with (silica) is amorphous, the phonons have 
quite a wide range of frequencies, resulting in a relatively wide frequency spectrum of the 
scattered radiation, particularly so for Raman scattering (Fig. 10.6) where a Stokes frequency 
& hift of about 1.5 x 10 n Hz is obtained with a linewidth of a similar magnitude to the shift 
it self. In the case of Brillouin scattering a maximum frequency shift of about I0 10 Hz. with 
a linewidth of about 3 x 10 7 Hz is observed. 

In Raman scattering the ratio of the Stokes to anti-Stokes scattering irradiancies. R y . 
depends on temperature according to (ref. 10.4) 

(10.5) 

v^here X^ and l d are the wavelengths at which Stokes and anti-Stokes scattering takes place 
an d Av is the difference in frequency between the exciting radiation and anti-Stokes line. 
/? s changes by about 0.7% per °C over the range 1 to 100 °C (see Example 10,3). Thus a 
measurement of /? N can be used as the basis of a (distributed) temperature sensor. Because 
a ratio is involved, the measurement is not affected by changes in irradiance of the probe 
beam. One problem with the technique is that Raman scattering is, at the wavelengths 
involved here, about a factor 10"- 1 weaker than Rayleigh scattering; consequently niulti- 
nnode fibers are normally used in conjunction with high power semiconductor lasers 
-Temperature resolutions of 1-2°C with spatial resolutions of 1-2 m over distanc-es of 
5-10 km have been obtained (ref. 10.5). Both forward and backscattering are possible with 
forward scattering being more easy to implement since the launch and scattered pulses 
emerge from opposite ends of the fiber, and hence pulse launch and detection is much easier 
t o implement although pulse timing information has to be coordinated between the two ends 
c f the fiber. 
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EXAMPLE 10.3 Temperature sensitivity of Raman scattering sensor 
From eq. (10.5) we have 




Taking the Raman shift in silica to be about 1.5xl0 13 Hz and assuming that 
T= (273 + 50) K = 323 K, we have that the fractional change in fl s for a 1°C change 



is 

6.6 x 10"- 14 x 1.5 x 10" 13 



T^ xl0^x(3 237~ = 6 - 88xl0 " lper ° C ° r °- 7% P er °C 



Brillouin scattering may also be used, although the technique is rather more complicated 
than in the case of the Raman sensor. In the usual configuration light from a CW laser is 
launched into one end of a single mode fiber, whilst a train of laser pulses is launched into 
the other end. The emission frequency of the pulsed laser is higher than that of the CW laser 
by the Brillouin shift. The pulsed laser now acts as a pump beam for stimulated Brillouin 
emission, so that the CW beam will increase in irradiance as it passes though the fiber. Now 
in fact the Brillouin frequency shift depends on temperature, so that if the temperature of 
the fiber is non-uniform then gain will only take place in those parts of the fiber which are 
at a particular temperature (which can then be related to the difference between the laser 
emission frequencies). If the power of the CW beam is monitored following the launch of 
a pump pulse, an increase will be observed whenever Brillouin gain occurs. Positional infor- 
mation about where the gain is taking place can be obtained from the time delay between 
the pump pulse launch and the arrival of the increased gain in the CW beam. If the frequency 
difference between the lasers is altered then different temperature regions will be picked out. 
Thus by scanning the frequency of one of the lasers the temperature profile along the fiber 
can be mapped out. Rather longer lengths of fiber can be employed (i.e. several tens of kilo- 
metres) than in the case of Raman scattering sensors with similar temperature but slightly 
lower spatial (* 5 m) resolutions (ref. 10.6). Unlike Raman scattering, Brillouin scattering 
is sensitive to strain as well as temperature and hence simultaneous measurements of both 
these quantities are possible. 

When it becomes possible to monitor rapidly and accurately the state of a structure it 
is then possible to envisage some sort of feedback mechanism whereby the form or the 
structure is modified to suit the prevailing conditions; thus an aircraft wing may change 
shape to suit varying heights, speeds and turn radii. This is the idea of smart structures 
and the advantages of optical fiber such as low mass, short response time and freedom 
from electromagnetic interference make it a very serious candidate for this type of situ- 
ation (ref. 10.7). 
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lO- 1 - 4 s ' n 8le mode fiber sensors 

In single mode fiber sensors, we are mainly dealing with the effects of the external quanti- 
ties to be measured on the phase (or mode velocity) of the light within the fiber. Some sensors 
involve polarization modulation, but these will be discussed later. 

/ O- 1- 4 - 1 p hase-modulated sensors 

OH e obvious way in which we may detect phase changes is to construct an interferonietric 
s y stem where the phase of the beam through a sensing fiber is compared with that of a refer- 
ence team- The Mach-Zehnder configuration is easily realized using optical fibers (Fig. 10.7). 
j^jght from a semiconductor laser is launched into a single mode optical fiber and the radia- 
tion spli* an d sent into 'sensing' and 'reference' fibers. The radiation from these two fibers is 
then mixed and split again at another 3 dB coupler; the output from each fiber is then detected. 
We now consider the form of the output expected from the interferometer. The electric 
fields of the two beams at one of the detectors may be written as A s exp[i(w/ + A<J> )J and 
^4 r exp(icoO where A<|> represents the phase difference between the two. In general, A<j> will 
Y,e made up of a 'static ' differential phase term A<)> d and a signal term <f> s where A<}> = A$ (! + <}>,,. 
-ffae total field is the sum of these two fields and the detector output (/?,) is proportional to 
th e P rocJ uct of this sum and its own complex conjugate, that is 



IA S exp[i(<or + A<)>)] + A T exp(iatf) ) \A S exp[-i(co/ + A$)] + A r exp(-icof) ) 



or 



H , ~ A ; + A I + A ^ s [exp(iA$) + exp(-i A$)] 

/?,« A 2 r +A; + 2A/i^cosA^ 

Assuming for simplicity that A T = A S = A, then 

i? I -Z4 2 [I+cos(A<(>)] (10.6) 

This response as a function of Aij> is shown in Fig. 10.8, which illustrates the basic problem 
^ith any interferometric sensor, namely that its output is periodic in the phase difference. 
j t is also evident that the greatest sensitivity to small variations in signal is obtained when 
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f lC. 10.7 Configuration of the optical fiber Mach-Zehnder interferometer. 
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FIG. 10.8 Relative response of a Mach-Zehnder interferometer as a function of the phase difference 
between the two arms (A<J>). 



the system is operating at points halfway between the maximum and minimum values of 
R v .These are known as the 'quadrature' points. Midway between these maximum sensitivity 
points the sensitivity falls to zero. Thus, even if we could ensure that we started off operat- 
ing at a quadrature point it is quite likely that changes in ambient conditions around the ref- 
erence arm would cause A<{> to change and hence cause a drift away from the quadrature point. 

So far, we have considered the output from only one of the signal detectors (R { ). It may 
be shown (see e.g. ref. 10.8) that there is a phase difference between the beams falling on 
the two detectors of n. Thus, the response of the second detector (/?,) may be written as 

^ 2 <*2A 2 [1 + cos(A<j> + n)] 



or 



(10.7) 



R 2 ^2A 2 [[ -cos(Ai)>)] 
Thus 

(/?, - /? 2 ) oc 4A 2 cos(A<>) 

In effect, therefore, by taking the difference between the two output signals we double 
the sensitivity of the interferometer. There are also advantages to be gained when the ampli- 
tudes of the beams from the signal and reference amis are not equal (see Problem 10 4) 
Problems associated with phase drift in the reference arm still remain, and we now examine 
ways of dealing with them. 

The most straightforward technique is known as the 'active phase tracking homodyne' 
scheme. This requires that we control the phase of the reference arm. One of the simplest 
ways of doing this is to wrap the fiber around a cylinder of a piezoelectric material. When 
a voltage is applied to the cylinder it expands radially, thus stretching the fiber and hence 
inducing a phase change. If this induced phase change is written as A<f> m , we may hold the 
system at the quadrature point by requiring that A«J, W + A<j> (l = (2m + 1 )ji/2. In the absence of 
any modulation of the sensing arm, this may be achieved by making the differential signal 
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pa.l~t of a feedback loop connected to the reference arm modulator, and then ensuring that 
the differential signal remains close to zero (Fig. 10.9). 

In the presence of a signal, we cannot use the differential signal directly in the feedback 
loop m this way. or the signal information will also be cancelled out. However, it is likely 
t jiat any ambient modulation of the reference arm will take place at a lower frequency than 
t frat of the signal, so that the difference signal (R x -R 2 ) integrated over a time long com- 
pared with the signal modulation but short compared with the ambient modulation can often 
provide a suitable feedback signal. 

The signal obtained before the feedback electronics will now be the same as in a 'perfect' 
s y stem operating at the quadrature point and with no reference arm phase drift. Such a mode 
f operation is known as a low gain -bandwidth product (GBP) mode. One problem with this 
j S that the output is only linear in A4> when A$<n. In addition, phase excursions greater than 
2?t can lead to ambiguities because of the periodicity of the output. A way of avoiding these 
difficulties is to operate the above system in a high GBP mode. That is, we remove the 
in te S rat i n g circuit and operate the feedback circuit at frequencies high enough to include ihc 
g jgnal frequency. At first sight, this would seem to obliterate the signal entirely; however, it 
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F ,C. 10.9 Active phase-tracking homodyne detection using either the low gain-bandwidth product (GBP) 
^de or the high GBP mode, in the latter case, an integrating circuit is not present in the feedback 
e jectronics. 
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is .still present in the signal applied to the phase modulator in the reference arm. We may now 
write for operation at the quadrature point 



or 



A<t> m = (2/n+l)^-A<t> d 



Assuming therefore that A0 m is proportional to the feedback voltage applied to the modu- 
lator, the feedback signal is linear in (A<J> d + <J> S ). This linearity will extend over many 2n phase 
changes, the only limitation being the phase range that can be covered by the modulator itself. 
The required phase signal <j> s may be separated out using electrical filters since <)>, usually 
oscillates at a higher frequency than A<|> d . 

The active phase tracking homodyne scheme outlined above is easy to implement and 
usually extremely linear in operation. There are a number of drawbacks, however. For 
example, the scheme uses an electrically active element which may require several hundred 
volts for the piezoelectric element. This may well reduce the sensor's usefulness as an elec- 
trically 'safe' device. Furthermore, although the phase range is much in excess of 2n it is 
still limited, and large phase excursions may necessitate resetting the system with conse- 
quent loss of information. 

Many other schemes have been implemented to overcome these two basic difficulties. 
Nearly all involve a considerable increase in complexity and none is really ideal. (There is 
insufficient space to consider them here, and the interested reader may consult ref. 10.9 for 
more details.) 

We turn now to the effects which various quantities to be measured have on the phase of 
the radiation in a single mode fiber. For simplicity, we assume that the mode behaves like 
a ray travelling down the centre of the fiber with velocity c//i,. Thus the phase (in radians) 
associated with a length L of fiber may be written as 




If X represents the quantity to be measured, then the change in 4> caused by a change AX 
in X may be written as 

2zLdn l 2nn, &L 

Because we have assumed that the internal ray angles (9) are 90°, any changes in the fiber 
core radius (a) do not affect $. However, even if 8 < 90°, it can be shown (see Problem 10.5) 
that <{> is much less sensitive to changes in a than it is to changes in L and n. Consequently, 
we may safely neglect the dependence of <j> on a. We now estimate the sensitivity of <}> to 
variations in axial force, pressure and temperature. 
An axially applied force, F, will extend the fiber thus changing L (it also changes n, but 
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v^e ignore this). If Young's modulus for the fiber is E, then we have 
AL _ F 
T~AE 

y^here A is the fiber cross-sectional area. Thus 
2nn { LF 

For silica, E = 2 x 10 1 ' Pa and, taking an external fiber diameter of 1 00 urn, a core refrac- 
tive index of 1 .45 and light of wavelength 1 urn, eq. ( 1 0.9) gives 

2:1x1.45 

A4> F = 

10- 6 X7tx(10- 4 )74x2xlO ]I 
= 6x lO-'radN-'m -1 

jf the fiber is subject to a uniform radial pressure P, the fiber radius will decrease and its 
length will increase. We neglect the effects of any radius change, but the length change is 
then given by 

M _2vP 
~~L~~E~ 

^vhere v is Poisson's ratio, and hence 



A4>p = 



4nn,LvP 



For silica, v ~ 0.2 and hence taking the same fiber parameters as above we obtain 



(10.10) 



A<t> P = 



4tcx 1.45x0.2 



10" 6 x2x 10" 
= 1.8x lO-'radPa-'m" 1 
^oth refractive index and fiber length are affected by temperature T, and so we may write 
2nL/dn l \ 

^"T^F + n * a ) AT (10.11) 

^here a is the linear expansivity. 
For a typical fiber 

— = 7xl0" 6 and a = 5xlO" 7 K-' 

A gain taking a wavelength of 1 urn and a core refractive index of 1.45 we have 

2t 

To" 



2 * 
Ai|» T = -—, (7 x 10- ft + 1.45 x 5 x 10" 7 ) 



= 49radK~' m 



-i 
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In this case, the thermally induced change in the refractive index has a much larger effect 
than does the thermal expansion term. We may also note that in practical terms the tem- 
perature sensitivity is extremely high when compared with the effects of pressure and axial 
force. Assuming that the interferometer can be operated with a resolution of lCT 6 rad, then 
the corresponding temperature sensitivity is 2.3 x 10" 8 K. It would appear that we have the 
potential for a very accurate thermometer. Unfortunately, every part of the interferometer, 
including the reference arm, has the same high sensitivity. To maintain the whole of the ref- 
erence arm at the same temperature to within 10" 8 K is a formidable task. In fact, the high 
sensitivity to temperature remains a difficulty when measuring other parameters and often 
limits the sensitivities that can be achieved. 

Quantities other than force, pressure and temperature can be measured, but usually by in- 
direct means (rotation sensing is an exception which will be dealt with separately). For 
example, attaching the sensing fiber to a magnetostrictive element (whose dimensions change 
in the presence of a magnetic field) enables magnetic fields to be measured. One problem 
with this technique is that such materials do not usually respond linearly to a magnetic field. 
Typical sensitivities are about 10" 7 TrrT 1 . 

The system limitations on sensitivity are rather too complex to go into in any detail here. 
The ultimate limits will be determined by shot noise at the detectors (see section 7.3.3.2). 
Generally speaking, in fiber interferometers using low-powered lasers (i.e. 1 mW), shot noise 
considerations limit the minimum detectable phase shift to 10" 6 - 10~ 7 rad Hz" 1 . Other factors 
that may be important are intensity fluctuations in the laser output and laser frequency jitter. 
The latter will only give rise to a noise signal if the optical path lengths in the two arms of 
the interferometer are unequal (if they were exactly equal, of course, the phase difference would 
always be zero). Although it may not always be practicable to have nearly equal arm lengths, 
to achieve 10" 6 rad sensitivity requires that the path differences be less than about 1 mm. 

10.1.4.2 Fiber optic gyroscope 

The fiber optic gyroscope relies on the Sagnac effect, that is the phase shift induced between 
two light beams travelling in opposite directions round a fiber coil when the coil is rotating 
about an axis perpendicular to the plane of the coil. To calculate the phase shift expected, 
we assume we have a single circular turn of fiber of radius R. It turns out that we must also 
assume that both light beams are effectively travelling in a vacuum. If there is no rotation, 
then both beams will return to their starting points in a time t, where t = 2nR/c. If, however' 
the ring is rotating in a clockwise direction at a rate of Q rads -1 , then the counterclockwise 
beam will arrive at its starting point sooner since the effective velocity of the beam will be 
c + RQ. and the time taken, /', will be given by 

2tiR 

f = 



c + RQ 

Similar arguments for the clockwise beam give a transit time of t", where 

2nR 

1" = 

c-Rti 
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The difference between the two transit times At, (=/" - f) is then 
At K = 2zR' ' ] 



4k£IR 2 



r-R z Q- 
Since c 2 t>R 2 Qr, 

4£M 

*« = — 

c 

vvhere A = nR 2 is the ring area. 
In terms of a phase shift 

4> s = 2rtA/ s v 

where v is the frequency of the light, or 

SnOA 

This expression is independent of the medium in which the light is travelling. Most optical 
fiber Sagnac interferometers are constructed with many turns (/V say) wrapped round a cir- 
cular former. We then have 



EXAMPLE 10.4 Phase shift in a Sagnac gyroscope 



Suppose we have 1000 turns of fiber on a former of 0.1 m radius; we may calculate 
the phase shift induced between two counter-rotating beams by the earth's rotation 
The earth's rotation rate is about 15° per hour or 15n/(180x3600) rads -1 
Assuming that Xq= 1 urn, eq. (10.12) then gives a phase shift of 

8n x 157tx 10 3 X7t(0.1) 2 
s ~ 180 x 3600 x 10- fi x3xl0 8 
= 1.9 x 10" 4 rad 



The basic arrangement of the Sagnac fiber interferometer is shown is Fig 10 10 The f 
of the output signal from the detector as a function of the rotation rate (and hence the „h™ 
change) will again be as shown in Fig. 10.8. At low rotation rates, the sensitivity will h 
very low and will reach maximum values at the quadrature points (i.e. when A =(->„+ 1 w/^ 
Another problem to be surmounted involves what is known as 'reciprocity'' This -on 
the necessity to ensure that the two counterpropagating beams travel absolutely identicll 
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FIC. 10.10 Basic layout of the Sagnac fiber interferometer. 

paths, since any path difference is a potential source of phase noise. In fact, the simple 
arrangement shown in Fig. 10.10 is not reciprocal; the clockwise beam is reflected twice at 
the beam splitter whereas the counterclockwise beam is transmitted twice through it. The 
reciprocity requirements can be very stringent. For example, in a system capable of meas- 
uring rotation rates as low as 10" 4 degrees per hour the paths must be reciprocal to 1 part in 
10 ! Problems also arise with circularly cored fibers since they are not very good at main- 
taining polarization. Thus, although the two counterpropagating beams with identical polar- 
izations may be launched into the fibers, energy may be coupled from one polarization mode 
into the orthogonal one as the beams traverse the fiber. The energy in the orthogonal modes 
will not be able to interfere with the original beams and a reduction in output will result which 
may be interpreted as a phase signal. 

Although it might be expected that a highly coherent source would be needed in any inter- 
ferometnc system, this is not always the case. For example, in the present system any scat- 
tering within the fiber (e.g. Rayleigh scattering) will give rise to spurious beams, which will 
be able to interfere with the two main beams at the detector if the coherence length of the 
radiation is the order of the fiber length. With a low coherence source, on the other hand, 
only those beams which have nearly identical path lengths (such as the two main beams) 
can interfere. A 'superluminescent' diode (section 5.10.2.7) is often used as a source in fiber 
gyros. This is basically a semiconductor laser into which is introduced sufficient loss to 
prevent full laser action. It acts as a high radiance LED. However, because of its laser-like 
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structure it is able to couple significantly more radiation into single mode fibers than can 
an ordinary LED. 

It can be appreciated that, with all these many problems to be overcome, the fiber gyro 
may never be capable of the highest levels of performance. Its place may well lie in lower 
cost systems of moderate resolution. 

/ O. 1.4.3 Polarimetric sensors 

In polarimetric sensors, the two beams in the interferometer are provided by the two ortho- 
gonally polarized modes in a Hi-Bi fiber (section 8.7.5.1). Because of the nature of such 
fibers, there are two fixed directions relative to the fiber along which the polarization vectors 
must lie. These are referred to as the 'fast' and the 'slow' axes, depending on the relative 
mode velocities. If we illuminate the fiber with light polarized at 45° to these directions 
(Fig. 10.1 1), then both modes will be excited equally. An increasing phase difference will 
then develop as the modes travel down the fiber. The polarimetric sensor depends on the 
effect of external parameters on this phase difference. One problem is how to get the two 
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FIC. 10.1 1 In polarimetric sensors, light entering the sensing fiber has its polarization set at 45- to th 
'fast' and 'slow' axes, as in (a). The emerging light is processed as shown in (b) by passing it through * 
Soletl-Babinet compensator (SBC) and then through a Woilaston prism (WP) The latter sepante 
two orthogonally polarized beams which then fail onto detectors d, and d,. The amplitudes of xCt 
output signals may be obtained by resolving the amplitudes as in (c). * " 
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modes to interfere, since they are orthogonally polarized. This is usually achieved with a 
combination of a Soieil-Babinet compensator and a Wollaston prism (Fig. 1 0. 11). The 
former is a variable phase plate and it is set to introduce a phase difference of n/2 between 
the slow and fast modes. The amplitudes of the two beams can now be written as 

A t exp(iur) and A^ exp[i(otf + n/2 + A<ti,)] 

where A(J> p is the phase difference between the beams before passing through the compen- 
sator. For simplicity, we write these two amplitudes as A' ( and A[. 

A Wollaston prism is a device that splits a beam into two physically separate components 
having orthogonal polarization. In the present instance, the prism is set to resolve the emer- 
gent light into directions at 45° to the fast and slow directions. Thus the amplitudes of the 
two beams from the prism are given by (A[ + A\)/^2 {= A\) and (A{- A' f )/V2 (= A',) (Fig. 
10.! 1). These signals are then allowed to fall onto two detectors d, and d,. Expanding the 
expressions for A' s and A\, and assuming for simplicity that A, = A t = A, we obtain 



\2 



exp(iajr) + exp 



cor + - + A<{> 



The beam irradiance /, (= A , x A ') is then 



A 2 I 
I\=~ ■ exp(icor) + exp 



i|w/ + - + A<t> p 



exp(-iw/) + exp 



-i f co/ h — + A(b 

I 2 r 



When multiplied out this simplifies to 

A=^ 2 [l +sin(A<j) p )] 
Similarly, we may show that for the other beam the irradiance is given by 

Ii=A 2 [\ -sin(A<t> p )] 
The interferometer output is then most conveniently processed as 



I,+I, 



= sin(A<b) 



For small values of A<fr p , the output is then proportional to A<t> p . The interferometer will of 
course only respond to the effect of external forces on the intrinsic birefringence B of the 
fiber. If we put B = n x - / ?v , then we have 



A«> D = 



2nZ.fi 



If X represents the quantity to be measured then we may write 



dA^,_2n/ dL d5 v 



dX 



dX 



dX 



(10.13) 



Here, the first term on the right-hand side represents the effect of length change, whilst 
the second represents the change in birefringence. By comparing eq. (10.13) with eq. (10.8), 
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^ e can see that the fo ™er term is much smaller than the corresponding term in a conven- 
tional two-fiber interferometer, the ratio being in fact B/, h or (/i t -»,)/«,. For example 
taking a fiber with a beat length of 2 mm, h,.-/^ x KT 4 (see eq. 8.48) and hence 
C«v -/ VV"i ~3 x 10" 4 . For fibers with a longer beat length, the factor will be correspond- 
ingly smaller. Thus we are justified in neglecting the first term on the right-hand side of eq. 
(10.13) in comparison with the second. 

We saw earlier (section 8.7.5.1) that there are basically two types of high birefringence 
fiber: elhptically cored and internally stressed. The former tend to be relatively insensitive 
to pressure, axial stress and temperature, since none of these parameters can change the core 
ellipsisity to any extent. Internally stressed fibers are produced by deliberately incorporat- 
ing elements into the cladding which have different thermal expansion coefficients, thus pro- 
ducing an internal stress when they cool down during manufacture. Thus the interna! stress. 
a nd hence fl, is expected to be temperature dependent. Equally, axial strain affects the* 
anisotropic stress within the fiber, again affecting B. External pressure, on the other hand. 
t>eing isotropic with respect to the anisotropic strain, has little effect on B. Obviously the 
sensitivities to external forces are very dependent on the exact type of fiber used. However, 
for typical internally stressed fibers these are as follows: temperature, 3 rad K ' m '; pres- 
sure, 10" 8 rad Pa" 1 m~'; and strain, 10 6 rad. 

It may seem perverse to develop sensors with deliberately reduced sensitivities. However. 
w e have seen earlier that the high sensitivity of the conventional two-fiber interferometer." 
particularly with regard to temperature, can be something of an embarrassment. One inter- 
esting possibility is that of a remote polarimetric sensor. This consists of two parts: (a) a 
'down' lead fiber and (b) a sensing fiber. Both fibers are of identical types except that they 
^re fusion spliced together so that their fast and slow directions are at 45° (Fig. 10.12) The 
end of the sensing fiber is silvered to reflect light back down the fiber. In operation, linearly 
polarized light is launched into only one of the two fiber modes (the 'input' mode). When 
the light reaches the splice, energy will be launched into both polarization modes of the 
sensing fiber. After traversing the sensing fiber twice, the light will arrive back at the splice 
2nd will then in general be coupled into the 'unused' mode of the input fiber (the 'output ' 
niode). Back at the launch end, the light in this mode can be separated out and monitored 
Any change in the differential path length in the sensing fiber will give rise to a change in 
the energy launched into the output mode. 

/ 0. 1.4.4 Polarization rotation sensors 

All the single mode fiber sensors considered so far have relied on the quantity to be niea 
sured affecting the phase of a mode within a fiber. Here, we consider the possibility of -,i t . " 
ing the angle of polarization of a mode. In section 3.7. 1 we saw that such a rotation c-in i-,V" 
place in a medium in the presence of a magnetic field (Faraday rotation) Obviously i,/,h 
light of the discussion in section 8.7.5.1 we require a fiber that is as isotropic as no'ssihl 
since any anisotropics can cause coupling between the orthogonally polarized mn1 
Unfortunately the Faraday effect is comparatively weak, and it is only possible to me, ' 
relatively large magnetic fields. One area where such a sensor has proved valuable is luT 
monitoring of large currents in electricity generating stations. ' 
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Down lead fiber 



Sensing fiber 




Silvered end 



Fast and slow axes 



Input mode polarization 



Output mode polarization 

FIC. 10 12 Remote perimetric sensor. The separation shown here between the down lead fiber and the 
sens.ng fiber is for pictorial convenience; normally the two are fused together. 



Suppose we consider a single turn of fiber of radius r which passes round a wire car- 
rying a current/ A (Fig. 10.13). Ampere's circuital theorem gives X x 2w = /, and hence 
« W/lnr. The amount of polarization rotation is, from eq. (3. 1 8), given by 9 = VBlnr 

' e vi H e ; We ^^ 9r = W ' VI ' Where *< is the relatlve Permeability of the fiber and V is 
rts Verdet constant. For n turns of fiber, the amount of rotation will be n times greater 



8, = *yviV7 



(10.14) 



de! Zl 1 n ° tlCed that thiS rCSUlt iS inde P endent of''- and indeed in general 8 r is indepen- 

i, r, °,f, f S1Ze ,° r ShapC ° f thC l0 ° P and the P0sitl0n of the inductor within the loop. This 

Lted Th ' Fe ^ ' SmCe k indiC3teS that the device Wl11 be insensitive to any vibrations. As 

of o2 r / SCnSltiVity ° f thiS dCVice 1S l0W ' and ' as Exam P le 10 - 5 show «, resolutions 
ot some tens of amperes are all that can be expected. 



EXAMPLE 10.5 Faraday rotation sensor sensitivity 

For silica, l/ = 4radirf f r' and u. r =l; hence fora 10 turn fiber coil and with/ = 30 A 
the polarization rotation given by eq. (10.14) is 

9 r = 4jix 10" 7 x 10x4x30xTr/180° = 0.09° 
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Optical fiber 



FIG. 10.13 Geometry for investigating the Faraday rotation induced in an optical fiber passing aro 
c urrent-carrying conductor. 



Normally, a resolution of about 0.1° in the polarization angle is possible, which is just 
the magnitude obtained here with a 30 A current. 



In this section on optical fiber sensors, we have attempted to give an overall view of the 
rnany different types of sensor that have been proposed and of the basic physical principles 
behind them. Inevitably, in view of the limited space available, a comprehensive survey has 
not been possible. For a more detailed discussion, the interested reader may consult some 
of the many review articles and books that have been written (ref. 10. 10). It should perha s 
be emphasized that as yet few of these sensors have achieved widespread commercial success* 

We move now to an area of application for optical fibers that has been making unsne^ 
tacular but steady progress for many years, that is the use of bundles of fibers either for V" 
mination or for imaging purposes. 



v"' 2 # Light-guiding fibers 

Apart from their use in fiber optic communications, transparent optical fibers have he 

for many years in a wide range of light-guiding applications in science, medicine and" T* 

try (ref. 10.11). Polished acrylic rods were first used for illumination purposes in !he tte 
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1920s. Most current applications, however, have arisen following the dramatic reduction in 
fiber losses resulting from Van Heel's suggestion that the fiber be provided with a cladding 
to reduce surface losses at each internal reflection within the fiber (ref. 10.12). Light prop- 
agates along these fibers in a zigzag manner in precisely the same way as we have described 
in section 8.3. 

The commonest light-guiding fibers consist of a core and cladding as in communications 
fibers. The transmission distances involved are quite short, however, so there is not the same 
importance attached to having ultralow loss fibers. Usually, an all-glass (i.e. borosilicate) 
type of fiber is used with losses of the order of 500 dB km 1 . There is a sufficiently wide 
range of compatible glasses to enable fibers with a wide ranee of numerical apertures to be 
fabr.cated. The numerical aperture (NA) is given by eq. (8.23), although in the present case 
the amount of light which can be accepted by the fiber through the generation of skew rays 
; s n f u n ™P° naM - Wi ™ a Lambertian source, the light-gathering efficiency may be about 
50/, higher than that predicted by eq. (8.23), which, of course, applies only to meridional 
rays. 

_ There are very few applications of single optical fibers (i.e. monofibers), but a brief con- 
sideration of these reveals the properties of fiber light guides. In general, the fiber can be 
regarded as an optical system which uses the light entering it to produce an output with 
the same solid angle but with reduced irradiar.ee. Thus, a simple application is the facility 
of having a light source remote from, for example, a hazardous environment Similarly a 
fiber may be used to collect light from only part of a source in terms of angular or spatial 
dimensions. Monofibers can be used in this type of application subject to limitations caused 
by the dimensions of the fiber and the requirements for flexibility. The allowed bending 
radius at which a fiber breaks depends on the fiber diameter, and as a rough guide a fiber 
or 3U nm diameter will break at a bending radius of about 1 mm, while a fiber of 2 mm 
diameter may be taken as being non-flexible. If the transfer of light can be achieved without 
a requirement for flexibility, then larger diameter fibers can be used. Fibers up to 10 mm 
diameter can be bent, if required, by softening them in a flame, with negligible effect on 
their transmission characteristics. Such fibers are used for illuminating otherwise inac- 
TZlu T T° nS Whefe SpaCC iS at 3 P remium - The y ™y> for example, be incorporated into 
I Tf °I% V u t0 ilJuminate its inside < as illustrated in Fig. 10. 14. If there is a require- 
ment tor the fiber to be flexible, either to allow for relative movement between the source 
and the point of application or because the path between these is not a straight line, then 
either small diameter monofibers or multifiber bundles may be used. It is quite straight- 
forward to draw and assemble many thin fibers with diameters of about 50 urn into a bundle 
or tignr guide. In these devices, the ends of the fibers are bonded together leaving them free 

h h °T T St ° f thCir length - The fiexibIe P° rtion is P rotected b y • for sample, a plastic 
sheath which , s bonded to a ferrule at each end of the bundle. Because of the cladding 
each individual fiber behaves independently of the other fibers in the bundle and so there 
is no cross-talk. 

The fibers are bonded together at their ends in a close-packed hexagonal array as illus- 
trated in hig. 10.15. Only the light entering the fiber cores is transmitted. A simple calcu- 
lation based on Fig. 10. 15 shows that in such an array the core packing fraction (i.e. the area 
occupied by the cores as a fraction of the total bundle cross-sectional area) is given by (see 
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piG- 10.14 Illumination of the inside of a vessel using a single fiber. 




Core 



Cladd 



ing 



FIG. 10.15 Close-packed hexagonal array of fibers. 



Problem 10.6) 
d 2 



F = 



K 



2V3 D' 



0.9d : 
l) 7 



(10.15) 

where d is the core diameter and D the total fiber diameter. Typically F = () 7 indie „ • . 
the light guide has about 0.7 of the total transmission of the individual fibers con u * 

In add.tion to simply transferring light between two points, light guides are uJu J "' 
converting and beam splitting. In shape convening, the ends of the bundle are bo, 7 ', ^ 
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different shapes to match different input and output conditions. A typical application is the 
collection of light from a spectral source and its guidance to the slit of a spectrometer. 
Similarly, the output end of a light guide could be shaped to illuminate a single line of script 
in a document scanner. 

In beam splitting guides, the total number of fibers is simply divided among several inputs 
or outputs (or both). Such beam splitting guides have been used for reading computer punched 
cards, while another common use is in illuminated signs such as those on motorways. Here 
a fiber bundle illuminated with a single light source is spread out to the appropriate parts of 
a matrix display (Fig. 10.16). Different display patterns can be formed by using different 
fiber bundles, each one being illuminated separately. To ensure that the angular distribu- 
tion of the emerging light is uniform irrespective of which bundle is being used, a large dia- 
meter monofiber is situated at each matrix position to act as a 'mixer' (Fig. 1 0. 1 7). Finally, 
a collimating lens reduces the divergence of the emerging beam, since a wide angle of view 
is not required, whereas it is essential to have as high a brightness as possible for the display 
to be visible in daylight conditions. Beam splitting guides are widely used as Y-guides in 



Mirro 




Display 



FtC. 10.16 Illumination of the legend on a motorway sign using a fiber bundle beam splitter. 




Beam splitter outputs 



Large diameter monofiber mixer 



Collimating lens 



FtC. 10.17 Use of a monofiber to insure an even angular output from a number of beam splitter inputs 
for a motorway sign. 
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v^hich the light traverses the guide in two directions. As shown in Fig. 10.2(a), light travels 
along one of the branches to the output end where it is reflected back into the other branch 
and guided to a detector. The detector signal then provides information about the conditions 
^hich exist in the vicinity of the common end. This system has been used for reflectance 
jjaeasurements, counting objects and proximity sensing, as we discussed in section 10.1.1. 

1 0-2.1 Coherent bundles 

Coherent bundles are assemblies of optical fibers in which the fibers are positioned accu- 
rately within the bundle such that spatial information can be transmitted. This is in marked 
c ontrast to the light guides discussed earlier where there is no correlation between the fiber 
positions at the two ends of the bundle. 

In coherent bundles it is most important that there is no cross-talk between the constituent 
fibers, as this would degrade the information being transmitted. Cross-talk is prevented by 
rnaking the cladding of the fibers sufficiently thick that light cannot leak from one fiber into 
neighbouring fibers. The output of each fiber will then depend solely on the light entering 
it s input end face (within its acceptance angle) and the transmittance of the fiber. Hence, 
^hen an image is formed on the end of a coherent bundle the light emerging from each fiber 
is simply related to the irradiance distribution in the portion of the light illuminating its input 
f^ce. Now, owing to the integrating nature of the individual fibers their outputs are of uniform 
ir radiance so that no detail finer than the fiber diameter will appear in the transferred image. 

The information transfer capacity is controlled by the resolution of the bundle, that is by 
tn e diameter of the individual fibers used. For this reason, the fibers used are of much smaller 
diameter than those used in light guides; diameters down to 5 u.m are common. Such fibers- 
are difficult to handle individually and are often produced as groups where the individual 
fibers are fused together into a multiple fiber, typically in a 6 x 6 square array. These mul- 
tiple fibers are then stacked together to form a coherent bundle in their own right, contain- 
ing up to a million individual fibers in lengths of a few metres with, a resolution of about 50 
jine pairs per millimetre. 

Flexible coherent bundles are fabricated by bonding the fibers together only at their extrem- 
ities. Such bundles are quite widely used as fiberscopes for inspection purposes. An object 
jens is used to form a real image on the input face of the fiberscope. This image is then trans 
ferred to an eyepiece. Because the resolution of the fiberscope is limited by the diameter of 
the individual fibers a certain amount of defocusing of the image formed by the object len ■ 
may not be detectable by the observer (see Problem 10.7). Applications of fiberscopes ran * 
from the inspection of the inside of pipes and castings to the medical inspection of the humf 
body - in medicine they are often referred to as endoscopes. 

10.2,1.1 Rigid coherent bundles 

The individual fibers in a bundle can be fused together by the application of heat anH 
sure to form a single mechanical unit. Physically, this behaves as a single piece of p|« ^ 
owing to the presence of the cladding on each fiber, it retains the optical pronenies nfl , * 
cm fiber bundle. In such bundles, the air gaps are totally removed in the S ^ "" 

l o P r ocess so 
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that the core packing fraction is simply the square of the ratio of the core diameter to the 
total diameter. This type of bundle is often made from very small diameter fibers <* 5 jam), 
and because the cladding has to be thick enough to avoid cross-talk the packing fraction may 
be as low as 0.5, although packing fractions as high as 0.9 can also be achieved. 

There are basically two categories of rigid coherent bundle. These are the image conduit 
which has a length much greater than its diameter, and the fused plate, which has a length 
less than its diameter. Indeed, plates with diameters up to 200 mm are available. 

During the manufacturing process, the fibers can be distorted around foreign particles 
present in the bundle and also any air not expelled may form bubbles, which again distort 
the fibers. These distortions allow light to escape from the affected fibers thereby reduc- 
ing their transmission relative to their neighbours with a consequent reduction in image 
quality. * 

Another consequence of the manufacturing process is that, as mentioned above, the bundle 
becomes effectively a single piece of glass so that stray light becomes much more impor- 
tant. In general, the light which falls on the end face of the cladding of a single fiber is not 
accepted by the fiber core but may be transmitted by internal reflections at the cladding/air 
interface. In long, flexible fibers, many such reflections occur so that the losses of stray light 
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FIC. 10.! 8 Schematic cross-section of a faceplate allowing photographs to be taken from a CRT display. 
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a re high and very little of it reaches the output face. In a rigid bundle, and more especially 
in fused plates, there are very few such reflections and a large fraction of the stray light may 
reach the output face. The effect of this is to increase the level of illumination uniformly 
ver the transmitted image, thereby reducing the image contrast. 

The most successful way of eliminating this unwanted light is to incorporate a second 
cladding of absorbing glass around the fiber - this is one form of what is known a,s extra 
mural absorption (e.m.a.). The absorbing cladding does not affect the light accepted by the 
c ore but removes only stray light which may enter the inner cladding. Thus, the light reach- 
ing the output face of the bundle is predominantly that which forms the transmitted image. 

While image conduits are not widely used, there are many applications for fused plates, 
perhaps their major application is as face plates of electro-optic devices where the visual 
output is formed by the excitation of a phosphor. Fresnel reflection losses at the plate faces 
c an be eliminated by the use of antireflection coatings so that up to 90% of the light emitted 
-j-,y the phosphor may be transmitted. This compares with a figure of less than 25% for len.s 
systems. 

A typical use of the fused plate is as a faceplate for photographing traces on a cathode 
r3 y tube. The plate must be vacuum-tight and capable of withstanding the changes of tem- 
perature which occur as the plate is coated with the phosphor. The trace excited by the elec- 
tron beam is transmitted by the fused plate, as illustrated in Fig. 10.18, directly to the 
photographic film placed in contact with the outer face of the plate. The image intensifier 
described in section 7.3.4 also incorporates fused plates. In this case, the plates have a dual 
r ole, namely light transmission and image shaping. 

PROBLEMS 



-1 0-1 Show that the displacement-output characteristic of the fiber displacement sensor 
shown in Fig. 10.1(c) is proportional to the factor [cos-'S-SsirKcos-'SJJ/n where 
6= 1 -{dfa), a being the radius of the expanded beam and d the displacement mea- 
sured from the edge of the expanded beam. Assume a constant beam irradiance across 
the expanded beam and a straight-edged shutter. 
Sketch this function over the range d=0 to 2a. 

10.2 Calculate the fraction of light transferred from the emitting to the receiving fiber i 
the two-fiber version of the Fotonic sensor shown in Fig. 10.2(b) where two fibers of" 
radius a and separation s are situated a distance z from a reflecting surface (Hint- "■ 
implied in the text, consider the 'coupling' between the image of the emitting fiber ■'. "f 
the receiving fiber. There are three regions to be considered- (a) R< a + S fh \ 
s + 3a>R> s + a, and (c)fl>,+ a , wh ere* = a + 2z tan[sin-'(NA)]. In the second reni In 
where the radiation cone partially overlaps the receiving fiber core assume th-.t u 
circular edge of the cone may be replaced by a straight edge; this enables the resu t „f 
Problem 10.1 to be used.) u,t oi 

Sketch the coupling efficiency as a function of z for typical values of a and x 

10.3 It is proposed to use the amount of light transmitted between two fibers who,''" , 
are offset longitudinally (see Fig. 8.34 and Problem 8.17) as an optical 'di s,™ 
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Investigate how the amount of light transmitted between the fibers varies with the 
refractive index and the absorption coefficient of the intervening medium. 

If the sensor is lowered from air to water, by how much would you expect the coupled 
radiation to increase? 

10.4 Derive the form of the output expected from a Mach-Zehnder interferometer when 
beam irradiances in the two arms are unequal. What advantage is there in this situa- 
tion in using both possible outputs from the interferometer? 

10.5 Use a simple meridional ray model (i.e. Fig. 8.6) to calculate the phase changes expected 
in a fiber of length L and radius a for a given mode when there is (a) a length change 
of 5L and (b) a radius change of 8a. Ignore the phase change on reflection. Hence show 
that for modes far from cut-off, the same fractional change in L will give rise to a much 
greater phase change than will the same fractional change in a. 

10.6 Show that with a close-packed hexagonal bundle of fibers the ratio of core area to the 
total bundle cross-sectional area is given by nd 2 /(2^3D 2 ) where d is the core diame- 
ter of an individual fiber and D its total diameter. 

10.7 The light from a point object is focused with unit magnification onto the end of a fiber 
whose core diameter is d. Show that the fiber may be moved backward and forward a 
distance/d/2<3 about the point of focus without altering the total amount of light enter- 
ing the fiber core. When fiber bundles are used for image transmission, this represents 
the extent to which the lens can be defocused and still give the same resolution in the 
transmitted image. 
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Answers to numerical problems 



1.3 5.3xl0 5 Vm-' 

1.6 0.8 mm; 47 mm 

1.7 1.5 x 10" s m; for ?. = 600 nm, 35 and 40; for ^, = 500 nm, 42 and 48 

1.8 6.25 x 10-* m; 12.5 xl(T 8 m 

1.9 Six orders; 409.5 nm; 468; 546 nm; 655.2 nm 

1.10 0.77 mm; 15.4 mm 

1.12 4.08xl0" Ic; J(2.55eV);487nm 

1.13 5.80 x 10~ 14 Hz; 2.79 x lO" 19 J (1.74 eV) 

2.3 5xl0-"m 

2.4 1.84 x 10 V 

2.5 6.02 x 10" 1X J; 2.41 x 10" 17 J; 5.42 x 10"' 7 J; 4.82 x 10" !7 J; 7.27 x 10 16 Hz 

2.7 1.64 x 10 16 m" 3 ; 224 £T ' m" 1 ; 80 Q"' m 

2.8 0.008 eV 

2.9 0.127; 0.873 

2.11 8.1xlO"m- 3 

2.12 3.5 x 10" 3 m 2 s -'; 1.25 x 10" 3 m 2 s" 1 ; 4.18 x 10" 4 m; 2.5 x 10" 4 m 

2.15 1 x 10 23 nT 3 

2.16 0.9 V; 4.88 xl0- 8 m; 3.68 xl0- 7 m 

2.17 0.72 uA; 99% 

868 nm; 517.6 nm; 5.52 nm 

Quartz 0.0162 mm; calcite 1.71 ^m 

3.3 8.4 xlO" 5 

3.4 KDP 7.53 kV; KD*P 3.03 kV; ADP 9.2.1 kV; LiNbO, 0.74 kV 

3.5 1400 V 
3.7 1.45 W 
3.9 0.4° 

4.2 260 MHz 

4.4 2.6 nits 

4.5 2tiAB(0); jiAB(0) 
4.7 0.5 

5.1 41 532 K; 69.2 urn 

5.2 10.05 m" 1 ; 0.366 
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2.18 
3.2 
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53 1.39 m- 1 ; 0.098 m- 1 

5.4 9.30xl0- 29 %;9.57xl0- 17 % 

5.5 5.6xl0 8 nr 3 

5.6 5.19xl0 7 Hz; 1.28xl0 9 Hz 

5.7 90.5% reflectance 
5-8 0.64 m" 1 

5.9 1 580 278; 3 xlO 8 Hz; five 

5.H 6.72 MWm" 1 

5.12 4.22 kW 
5-13 0.7 

6-1 0.1m; 0.21 K 

6.2 474.6 ms" 1 

6-4 4xl0 7 Hz 

6-5 0.316 mm 

6-6 4.05 x 10" WnT 2 sf ! ; 6.89 x 10 L1 lm m" 2 sr"' 

6.7 3.33 ns; 667 ps; 13.4 ns; 17.5 fs; 0.44 ps; 2.05 ns 

6.8 5 m; 122.45 m 

6.9 17.59 MeV 

7-1 10-' n W 

1.2 48 um 

7.4 5.5xlO-"W 

7.5 1.7xlO" 6 A V" 1 J" 1 

7.7 10 12 photons s" 1 

7-10 7.26x 10" 11 W;2.44xl0 -12 W 

7.H 6.4 mA; 0.346 V 

7.13 y = 0.922; x =0.427 
7.15 57 urn 

8.1 48.8° 

8.3 \|/ = 63.8°; 6 = 64.7° 

8.4 0.52%; 1.3x1 0" 2, % 

8.6 2230 dBm" 1 

8.8 9.2°; 1.441; 1263; yes to 6.9° 

8.9 30nsknT ] 

8.10 0.68 um; 3.6 um 
8.H 150psknr' 

8.14 52.3 dB km" 1 

8.15 0.0294 dB km" 1 , 2.29 um 

8.18 5.5dBkm _I 

8.19 About 10 MHz 

8.23 About4xl(r 3 dBknr I 

9.1 0.391% (1 part in 256) 

9.2 64 Mbps 
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9.3 


6 km (range limited by absorption) 


9.4 


14.9 |iW 


9.7 


0.923 


9.9 


S/N 221 (■ 23.4 dB); 318 ft; S/N 1387 (= 31.4 dB) 


9.11 


21.5; 10.5 dB 


9.12 


10 nW 


9.15 


1.63 mm 


9.16 


0.86 V 


10.3 


1 .77 times 
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Birefringence 



ys^iisotropy m crystals is caused by the arrangement of atoms being different in different direc- 
tions through the crystal. Consequently, the electric polarization (or dipole moment per unit 
v olume) P produced in a crystal by a given electric field % depends on the direction of the 
field. It follows that the relative permittivity e r and the refractive index, which is given by 
n = (e r ) (ref. A2. 1 , Chapter 1 1 ), are also direction dependent. 

Because electromagnetic waves are transverse, the refractive index for a given direction 
of propagation of light through the crystal depends on the value of the electric vector*, which 
is, of course, perpendicular to this direction. In fact, the electric vector oscillates in a plane 
perpendicular to the wave velocity direction and thus the velocity of the waves through the 
cr ystal will be dependent on the plane of polarization of the light. This phenomenon, called 
birefringence, occurs naturally in some materials, and it may be enhanced by the applica- 
tion of external forces .such as those due to electric fields. It may also be induced in isotropic 
materials by the same means. 



A2-1J Natural birefringence 

ytfe may appreciate how birefringence arises as follows. Let us consider placing a dielec- 
tric medium into a parallel plate capacitor to which a constant voltage is applied. The elec- 
tr ic field across the medium polarizes it and dipoles appear throughout the material, lined 
u p in the direction of the field. The resulting polarization can be shown to be related'to the 
electric field and electric displacement, D, by the equations (ref. A2.1, Chapter 3) 

D = e f + P 

D = e e ! 9 = E* (A2.1) 

iTie ratio P/e « is called the electric susceptibility, x, and we see from eqs (A2.1) that 

J*o«-(*,-l)«X {A22) 

In isotropic media (and cubic crystals), direction in the medium is not important, the vectors 
<$, D and P are all parallel, and e r , n and X are scalar quantities. The speed of propagation 
f electromagnetic waves in such media is constant irrespective of the direction of propa 
gation. This is not the case in birefringent materials. In general, in such materials the polar" 
iz ation occurring in response to an applied electric field depends on both the magnitude and 
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direction of the field and, in addition, the induced polarization may be in a different direc- 
tion from that of the field. Thus, for example, a field applied in the v direction will induce 
not only a polarization P v parallel to itself, but possibly also polarizations P r and P. in the v 
and z directions. Thus, from eq. (A2.2) we may write 

A > , = E 1) (X II *,+Z,2« > +Xl.1*-) 

P, = e () (x 2i «, + X22S.V + X 2 3* : ) (A2.3) 

where the capital letters represent the complex amplitudes of the corresponding time-varying 
quantities. The 3x3 matrix array x,, is called the electric susceptibility tensor. Alternatively, 
we can use the electric permittivity tensor e /; where D, = z i J& ] to describe the dielectric response 
of the crystal to an applied field. These tensors are also 3x3 matrix arrays. 

The magnitude of the x ;> (and e y ) coefficients depends on the choice of the .v, y and z coor- 
dinate axes relative to the crystal structure. It turns out always to be possible to choose x,y 
and z so that the off-diagonal elements vanish, giving 

' x = e oZ 1 1 6< 

P y = € 0Z22 r fv (A2.4) 

and similarly 

D > = e 22- g v (A2.5) 

where from eqs (A2. 1 ) and (A2.4) we see that e , , = e ( I + x , , ) etc. These directions are called 
the principal axes of the crystal and the corresponding diagonal terms e n , e 22 and e„ the 
principal permittivities. 

An electric field parallel to a principal axis will only produce a parallel electric polarization 
but in general the three permittivities are different. This variation in permittivity and the cor- 
responding variation in refractive index and hence in wave velocity is the origin of bire- 
fringence. In general, anisotropic crystals are thus characterized by the three principal 
refractive indices n x , n y . and n., where the subscripts relate to the direction of polarization 
of the electromagnetic waves and not to their direction of propagation, and where 
N, = (£ n /e ) I/2 = (l +Xl ,)^etc. 

Thus consider, for example, a wave propagating in the z direction. If its electric field, that 
is its plane of polarization, is parallel to the x direction, it will induce only polarization P x 
according to eqs (A2.4) and (A2.5) and will experience the permittivity e, ,, that is the refrac- 
tive index n r If, on the other hand, the wave has its plane of polarization in the v direction, 
it will experience the refractive index n v . Any unpolarized wave propagating in the z direc- 
tion can be resolved into two components with polarizations parallel to the x and v direc- 
tions. These two components will thus travel through the crystal with different velocities, 
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and hence get progressively out of phase, so that the resultant at some point : will, in general, 
represent elliptically polarized light. The exact nature of the light depends on the phase dif- 
ference and the relative amplitudes of the two components. 

Anisotropic crystals arc divided into two classes. The first class, which includes calcite 
(CaC0 3 ), are characterized by a principal axis (the z axis by convention) along which the 
permittivity is e 33 , but perpendicular to which the permittivity is independent of direction 
(i.e. E n = e 22 ). Thus, such crystals only have two principal refractive indices. The r direc- 
tion is unique in that the velocity of propagation is independent of the wave polarization; it 
is termed the optic axis. Such materials are said to be uniaxial. The difference between 
n x (= h,.) and n. is a measure of the birefringence and is said to be positive or negative depend- 
ing on whether n, is greater or smaller than n x . Thus calcite is a negative uniaxial crystal, 
while quartz, for example, is a positive uniaxial crystal. 

Crystals of lower symmetry possess two optic axes and are said to be biaxial , These axes 
do not coincide with any of the principal axes and consequently »,, // v and ;;. are all differ- 
ent. 

If Maxwell's electromagnetic theory is applied to an isotropic medium, it can be shown 
that light of any polarization can propagate in any direction. In the ease of anisotropic mate- 
rials, however, this is not the case and it turns out (ref. A2.3) that as a result of birefrin- 
gence only two states of polarization can propagate for any crystal direction. These planes 
are perpendicular to each other and hence, except when propagation is parallel to an optic 
axis, the two components experience different refractive indices and travel at different 
speeds. 

The values of refractive index and the states of polarization for a particular direction of 
propagation can be determined from a three-dimensional representation of the refractive index 
known as the index ellipsoid (or optical indicatrix) (ref. A2.2). The form of this can he 
obtained as follows: the energy density W in a dielectric is given by 

W=\D-9 (A2 . n) 

Thus using eq. (A2.5), we have 

W = \ 

A constant energy surface then takes the form of an ellipsoid in a three-dimensional space- 
where the dielectric displacement is the coordinate. 

Dividing both sides by W and writing D 2 j2e {) W as .v 2 etc. leads to the expression 

2 ' 7 

x y z~ 

~ "I T H 7 = 1 t A~> 1\ 

n; n; n: K*~n 

where n] = e n /e etc. 

This equation represents an ellipsoid with semiaxes n x , /i v and n.. For a uniaxial ervstal 
>h = "v * ".- and tn e index ellipsoid, which has circular symmetry about the ; axis, is as shown 
in Fig. A2.1. 

It is usual to refer to n x (=/? v ) as the ordinary refractive index n„ and to n. as the 
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y axis 



FIC. A2.1 Refractive index ellipsoid or optical indicatrix for a uniaxial crystal. Waves with polarizations 
parallel to the z axis experience the refractive index n c , while those polarized parallel to the x and y axes 
experience the refractive index n r ,. For propagation of a wave in the general direction r, there are two 
allowed directions of polarization: parallel to OP, with index n„, and parallel to OQ, with index n e (6). 



extraordinary refractive index n e , in which case we may write 



x' y l z 2 



(A2.7a) 



Let us consider light propagating in a direction r, at an angle 9 to the optic (z) axis (Fig. 
A2.1). Because of the circular symmetry mentioned above, we can choose, without loss of 
generality, that the v axis should coincide with the projection of r on the xy plane. The plane 
normal to r intersects the ellipsoid in the shaded ellipse shown. The two allowed directions 
of polarization are parallel to the axes of this ellipse and thus correspond to OP and OQ; 
they are thus perpendicular to r as well as to each other. The two waves polarized along these 
directions have refractive indices given by OP = n and OQ s n e (Q) (the extraordinary wave). 
In the case of the extraordinary wave, the plane of polarization varies with 9 as does the refrac- 
tive index. We can determine the relationship between n e (6) and n e , n with the aid of Fig. 
A2.2, which shows the intersection of the index ellipsoid with the yz plane. From the diagram, 
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FIG. A2.2 Intersection of the index ellipsoid with the yz plane. OQ = «,.(&) represents the refractive mde\ 
of the extraordinary wave propagating in the r direction. 



we see that 

n;{Q) = z 2 + y 2 
and 

z = A2 e (6)sin e 

Thus, substituting these equations into the equation for the ellipse shown, namely 

2 2 

y z , 
— + — -=1 

K K 



gives 



1 cos 2 8 sin 2 9 



<(9) < 

Thus for 9 = 0°, that is propagation along the optic axis, ,! e <0°) = /; o , while for e = y<)° 
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h l ,(90°) = //,.. The two polarizations which can be propagated correspond to the maximum 
and minimum refractive indices given by the index ellipsoid (in the case of positive crys- 
tals >!,,>)>„, while in the case of negative crystals n e <n ). For propagation parallel to the 
optic axis (i.e. the z direction), there is no birefringence as the section of the ellipsoid per- 
pendicular to this direction is a circle. For propagation perpendicular to the optic axis, for 
example the x direction, the birefringence will be a maximum, the permitted polarizations 
will be parallel to the v axis with refractive index n u and parallel to the z axis with refrac- 
tive index /?,.. 



C A2,2 3 Induced birefringence 

When an electric field is applied to an anisotropic crystal, new principal axes may be intro- 
duced (e.g. in KDP) while birefringence may be induced in naturally isotropic materials such 
as GaAs. The electric field changes the shape of the refractive index ellipsoid as a result of 
changing the permittivity tensor e ., After deformation, the axes of the ellipsoid are no longer 
the original principal axes and the ellipsoid must be represented by an equations of the form 





r + b r + 2 hh' : + 2 h- l - + 2 --vv=l (A2.8) 




The applied electric field causes a change in the refractive indices of the ordinary and extra- 
ordinary refractive indices. This change can be written as 



<My 



» h j (A2.9) 

where i = 1 , 2, . . . , 6,y = 1 , 2, 3 , and r i} is the electro-optic tensor with 6 x 3 elements. For most 
materials, however, the symmetry of the crystal reduces the 18 elements of this tensor sig- 
nificantly. We have indicated the values of some of these electro-optic coefficients in Table 
3. 1 for a representative selection of optical materials. We see that the Pockels effect discussed 
in Chapter 3 will thus be different in the various crystals. In general, crystal class and sym- 
metry (ref. A2.3) are important and a universal response cannot be predicted. Let us con- 
sider as an example crystals similar to KDP. In this class of materials, the only non-zero 
elements in the electro-optic tensor are r 4I , r 5 , (= r 41 ) and r H . In an arbitrarily directed field, 
the index ellipsoid becomes 

n 2 + ZI + ~I + 2r 4 ;£jz + 2r it r £ r xz + 2r 6 &.xy = 1 (A2.10) 

"o " l' e 

We note that the first three terms are field independent and hence involve n and n e , and 
that the field induces cross-terms yz etc. into the equation for the index ellipsoid so that the 
major axes of the ellipsoid are no longer parallel to the x,y and z axes as mentioned above. 
We must now therefore find the magnitudes and directions of the new axes, in the presence 
of the field, so that we may determine its effect on the propagation of electromagnetic waves. 
It is usual to restrict the field to the z direction (the original optical axis) so that 'g ( . = « y = 
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and eq.(A2. 10) becomes 

2 ,2 _2 



(A2.I 



The problem now is to find a new coordinate system a', /, z in which cq. ( A2. i I) contains 
no cross-term and takes up the form of eq. (A2.7), that is 



*' v'- :" 



(A2.12) 



The lengths of the major axes of the index ellipsoid are thus 2/» ( .. 2n s ... In, ami will, in 
general, depend on the applied field. By inspection of eqs (A2. 1 1 ) and (A2. 1 2). the r factor 
is unchanged if z is parallel to z and hence n, = n e , and also eq. (A2.1 1 ) is symmetric in .» 
and v so the new axes x, / bear similar relations to the former axes x ami v; they arc in fact 
rotated through 45° as shown in Fig.A2.3. The relationships between the two' sets of axes 
are then 



x = x' cos 45°+ /sin 45° 
y = - y' sin 45° + / cos 45° 

Substituting eqs (A2.13) into eq. (A2.1 1) then gives 



„>2 



1 



1 



**'.* -'"M + '.;r '«■*-- ♦?-' 



(A2.U) 



(A2.14) 



Equation (A2.14) shows that*',/, z' are the new principal axes and that the field has rotated 
the ellipse and changed the lengths of the major axes. From eqs (A2.12) and (A2. 14)'the 




FIG A23 Rotation of the * and y axes into the *< and / axes, when crystals such as KDP are sul,,, ., 
an electee field along the z axis, in which case z and z' are parallel ' l U 



are parallel. 



dto 
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length of the major axis in the x' direction, 2/?,,, is given by 



1 1 



Hence the change in refractive index may be evaluated by noting that 
J |_ _ ("q ~ "v)K + ",-) 2(n - n,) 



2 2 2 ? r 



i o 



assuming that the change is small so that /7 t , = n . Therefore 



n] 



^ = " a + -J^i, : (A2J5) 

and similarly 



nl 



"'""<> 2 r63<£ -" (A2.16) 

and, as mentioned above, n z = n . 

Similar relationships can be derived for other classes of crystal and in general n = ± £#i* r%, 
but the matrix element r varies according to the crystal class (see Table 3.1). 
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Limitations on LED frequency response 

due to carrier diffusion and 

recombination 



We consider a relatively low level injection of electrons into a semi-infinite piece of p-type 
material with unit cross-sectional area. The steady state diffusion equation (eq. 2.4 1 ) for the 
excess minority carrier density at a point x, An(x), may be written 

3 An{x) a 2 

— An(x) = + D e —^ An(x) (A3. 1 ) 

O' T„ OX 



We suppose that the excess population has a time-varying component and we write 

An(x) = An°(x) + An l (x) exp(2mft) ( A3.2) 

Substituting for An(x) in eq. (A3.1) and separating out the time-dependent and time-inde- 
pendent parts, we obtain 

d 2 n An\x) 
- 7 An%<)- -0 (A3 ._„ 

c e 

and 

3 2 , An\x) 

3? An(X) "^7 (1+27tl/Te) = ° (A3.4) 

Both of these equations have general solutions of the form 

An(x) = A exp(-x/L e ) + B exp(x/L e ) ( A3 

where A and B are constants and L e = "ioJ e . At x = °° we must have A/i () (°°) = A/i '(«,) = q . 
hence B = 0. The solutions to eqs (A3.3) and ( A3.4) can therefore be written 

An°(x)=An°(0)exp(-x/L e ) 

(A3. 6) 

and 

A« 1 U) = A/7 I (0)exp(-VL*) 

(A3.7) 
where 

(l:) 2 = ee 
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We may define a frequency-dependent relative quantum efficiency Q(f) by 
Q(f) = P(f)lF{f) 



(A3.9) 



Here P(f) and F{f) are the frequency-dependent pans of the photon generation rate within 
the material and the electron flow rate into the material respectively. 
From eq. (A3. 2), 



F(f) = D e 



dy 



An l (x) 



ZJ e A;?'(0) 



(A3.10) 



The recombination rate in a region of width 5.v is given by A/z(.v)&v/t l . (see eq. 2.38), and 
hence the total recombination rate throughout the material is given by 

I r 
- An(x) civ 

If we assume that each recombination gives rise to a photon then 

1 r , 
P(f)=-\ An\x)±v 



Hence, by substituting for An\x) from eq. (A3. 7). 
TO = ir>>(0)exp(-.r/Z.*)d,-^^ 

From eqs (A3.9), (A3. 10) and (A3.1 1) we then have 
(O 2 A/j'(0) 1 



(A3.ll) 



Q(f) = 



i£>. 



1 + 27ti/T e 



The modulus of this expression is 1/(1 + 4jr/ 2 T 2 e ) l/2 and hence we see that the light output 
/?{/) resulting when the current is modulated at a frequency/can be written 



R(f)=- 



R(0) 



(1+4tc 2 / 2 t) 



2 z-2 2x1/2 
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Interactions between radiation and 

electronic energy levels with finite 

frequency linewidths 



In section 5.2 we derived a relationship between the Einstein coefficients (eq. 5.1) by 
analyzing the interaction between a system with only two electronic energy levels and 
t>lackbody radiation. It was assumed that the energy levels were infinitesimally narrow 
and that only radiation at a 'single' frequency was involved. In the first part of this 
appendix we demonstrate that eq. (5.1) remains valid when the energy levels have a finite 
width. 

We start by considering those transitions within the atomic system which give rise to emis- 
sion frequencies between v and v + dv (Fig. A4.1). Suppose that the number of atoms per 
unit volume in the lower energy state which can absorb radiation with frequencies between 
v and v + dv is written d/V,(v) whilst the number in the upper state which can give rise to 
stimulated emission over the same frequency range is written dM,(v). The number of absorp- 
tion and stimulated emission transitions will then be proportional to the density of radiation 
which lies between v and v + dv, that is to p(v. T) dv (eq. 1 .42). Thus we may write the number 
of upward transitions within this frequency range and per unit volume as d/V ] (v)p(v,D dv B >. 
Similarly the corresponding number of stimulated emission transitions can be written 
dA , 2 (v)p(v, T) dv B n . Spontaneous emission transitions will also contribute to the total 
number of downward transitions. However, we must remember that all the atoms in the 
excited state (i.e. A\) can potentially cause photon emission between v and v + dv so that 
the probability that a given atom will do so will in fact be proportional to the factor i>(v) dv. 
where #(v) is the lineshape function. Thus we can write the spontaneous emission contri- 
bution i^N^{v)(ivA 2l . 

Since the system is in equilibrium the upward and downward rates involving photon fre- 
quencies between v and v + dv must be equal and hence we have 

dN, (v)p(v. T) dv B n = cW : (v)p(v. T) dv B 2] + N 2 g(v) dv A 2l (A4 , } 

We now integrate both sides over the width of the emission line. If we make the very rea- 
sonable assumption that neither the blackbody radiation density function nor the Einstein 
coefficients will vary appreciably over the narrow frequency range of the emission line the 
result is 

p(v ,T)B u l cW I (v)dv = p(v 0l 7)5 2 J o "cW 2 (v)dv+^ 2I J o "^v)dv (A42) 

where v„ is the frequency corresponding to the centre of the emission line. Using th - 
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v + dv 



Frequency 



FtC. A4.1 



An atomic transition between two energy levels with a lineshape function g(v) is bathed in 
blackbody radiation which has a radiation density of p(v, J). 

relationships 



r 



Jo dN ] (v)dv = N l J"cW 2 (v)dv=^ 2 and J o "^(v)dv=l 
we now obtain 



p(v , T)BnN x = p(v , T)B 2l N 2 +A 2l N 2 

This expression will be valid whatever the value of v , and so we may replace it by a general 
frequency v; for notational convenience we also replace p(y,T) by p v . Thus we have 

N lPv B 2i =N 2Pv B ll+ N 2 A 2i (A43) 

which is identical to eq. (5.1). 

When dealing with the absorption of radiation caused by an atomic transition within an 
atom in section 5.3 it was assumed both that the radiation concerned was perfectly mono- 
chromatic and that the atomic energy levels were infinitesimally narrow. Here we investi- 
gate the consequences of allowing both the beam and the atomic transition to have finite 
frequency widths, although, in contrast to the situation considered in the first part of this 
appendix, we assume that the beam frequency bandwidth is much narrower than the tran- 
sition lmewidth (Fig. A4.2). 

The function g(v) dv gives the probability that a particular atom is capable of absorbing 
or emitting over the frequency range v to v +dv, so the number of atoms per unit volume 
which are capable of absorbing radiation between these frequencies can be written as 
/V,*(v) dv. In the presence of a radiation density of p(v) the number of upward transitions 
per second per unit volume between these frequency limits can then be written as 
N&v) dv fi I2 p(v). We now integrate over the frequency range of the incoming beam to obtain 
the total upward transition rate, R ]2 , due to absorption from the beam: 

*i2=J N i B n8Wv)dv (A4>4) 

Since the beam linewidth is assumed to be small compared with the atomic transition 
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Relative 
irradiance 



Atomic absorption 
profile, g(v) 




Frequency, v 



FIG. A4.2 A beam of radiation with a relatively narrow linewidth centred on v H passes through ,1 
collection of atoms where there is an atomic transition having a Itneshapc function l'(v) centred on v 



linewidth, the function g(v) can be brought out of the integral and replaced by g{v n ), where 
v B is the centre frequency of the beam. Thus we have 



^i2 = ^i2g(v B ) L~p(v)dv 



(A4.5) 



The integral on the right-hand side of eq. (A4.5) is simply the energy density of the entire 
beam, which we write as p B , so that we have R l2 = N fi l2 g{v B )p B . Similarly we may write 
the downward transition rate due to stimulated emission as R 2 i = N 2 B hR( v b)Ph- Thus the net 
rate of loss of photons per unit volume, -dK/dt, from the beam as it travels through a volume 
element of medium of thickness A* and unit cross-sectional area (Fig. 5.3) can be written 



~ ~d7 = NlP * B I2,?(Vb) ~ ^Pb^i^b) 
or, substituting for B 12 and B 21 from eq. (5.7), 
dAT (g 2 ^ 



(A4.6) 



df 



■^i-N 2 \p B B 2]S (v B ) 



Since the beam can have any frequency, it is convenient simply to refer to it as having -i f . 
quency v, so that we put p B = p v and g(v B ) = £(v), to give fC " 






■} 



(A4.7) 



Thus we obtain an expression similar to eq. (5.1 1) except that the right-hand side is mult' 



plied by the function g(v). It follows that eq. (5.13) should similarly be modified to r 

( 



«(vM^,-Aa a "' ivgw 



id 
(A4.8) 
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Under conditions of population inversion the small signal gain coefficient can then be written 



as 



*(v) = fN, A/,] (M9) 
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Optical bandwidths and pulse 
broadening 



Let us suppose that at the input end of some optical system we have a signal where the optical 
power is modulated at a frequency/. That is, P in = P' ) [\ -cos{2nft)]. If we write the optical 
power output of the system as P ml = P° [l - cos(2nft + $)], then the ratio of P\\ to P' tt gives 
the system response, which we denote by H(f). We define the optical bandwidth of the 
system as the frequency range Af opt over which \H(f)\ exceeds one-half of its maximum 
value. If we allowed the optical output signal to fall onto a 'perfect' photodctcctor, then the 
current output, / d , would follow the variations in H(f). Electrical power from the detector, 
being proportional to i\, will follow \H(f)\ 2 . In electrical systems the electrical bandwidth 
is defined as the frequency range over which the electrical power exceeds one-half of its 
rnaximum value. We can therefore define an electrical bandwidth for our optical system, 
A/ei» as the frequency range over which \H(f) \ exceeds 1/V2 of its maximum value. Similar 
arguments apply when a voltage output is obtained. 

We turn now to pulse broadening. Suppose we launch an optical pulse which, as far as 
the system response is concerned, is infinitesimally narrow. When the pulse emerges from 
the system, it will have broadened into a definite shape. This shape may be described by 
the normalized impulse response h(t). The function h{t) is normalized so that 

J _ h(t) dr = 1 

The functions h{t) and H{f) are in fact related by a Fourier transform (ref. A5. 1 ). that is 

H ^ = II h ® exp(-i27t/0 dr (A5l) 

A useful parameter for characterizing the spread of a pulse is the r.m.s. pulse width. This is 
given by a where 



a 2 =f t 2 h(t)dt-r th{t) 



df 



(A5.2) 



If h(t) is symmetrical about a time t , then by measuring time from /„ we ensure th-jt the 
second term on the right-hand side of eq. (A5.2) is zero. 

a is a useful quantity because it may be shown (ref. 8.21) that any pulse having -in r m • 
pulse width <y, that passes through a linear system whose impulse response has an r m s wi Ith 
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a 2 will emerge with an r.m.s. pulse width o 3 where 



or + o; 



(A5.3) 

By using eq. (A5.1) in conjunction with the definitions for Af and A/ e! , it is possible to 
determine expressions for the two latter quantities provided the function"^') is known. To 
give a relatively simple example of this type of calculation, we may take a rectangular nor- 
malized impulse response (Fig. A5.1a). That is, we have 



1 AT AT 

h{t) = — < t < — 

AT 2 2 



and 

h(t) = t<-— t>~ 

2 2 

From eq. (A5.1) 
1 r A r/2 

//(/> ArJ-.7-/2 expH2 ^ )d ' 

i2nfAT\ 



AT->- A m 



2nfAT 
sin(nfAT) 



exp 



(\2-nfAT 
^{~2 



Thus \H{f)\ falls to one-half of its maximum value (i.e. 1 at/=0) when ti/AT = 1 895 that 
1S A /op, = 0.603/A7. Similarly, A/ eI = 1 392/(kAT) or A/ d = 0.443/ At. 



(a) 



h(t) 




VAT 
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(b) 




(c) 



h{t) 




FIG. A5.1 Three 'theoretical' system response functions: (a) rectangular, <b) Gaussian and (c) exponential 
decay. 



From eq. (A5.2) we also have 
a 2 = — t 2 dt 

AT J -Ar/2 

Thus a 2 = Ar 2 /12 and hence a = AT/2^3, and we can therefore also 



write 
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A/ op , = 0.174/a and Af el = 0A2S/c 

When we are dealing with digital signals, the bandwidth is referred to in terms of B, the bit 
rate. We are then concerned with how close a succession of optical pulses can become before 
they can no longer be resolved (see Fig. A5.2). It may be shown (ref. A5.I) that for a wide 
range of pulse shapes B should not exceed l/4a. Thus we can write for rectangular pulses 
that 

A/ opl - OJB and A/ tfl = 0.5fl 

Of course, all the above relationships between A/ opt , A/ d , a and B are only valid for rectan- 
gular pulses. For other pulse shapes, the relationships will be slightly different. A pulse shape 
that is often convenient to use is the Gaussian shape, for which 



hit) 



i 

^exp(-r/2a 2 ) 



This is illustrated in Fig. A5.1(b). We may proceed as for rectangular pulses, although the 
integrals involved are more difficult to evaluate, and we merely quote the results: 

H(f) = e xp(-2n 2 f 2 o 2 ) 

and hence 

A/ op , = 0.187/a 
A/ cI = 0.I32/a 

Since we do not often know exactly what pulse shapes we are dealing with, it is convenient 
to use the following approximations: 

A /o P1 -2A/ c1 



Resultant signal 




Time 

FIG. A5.2 Two overlapping pulses giving rise to a resultant signal that may be difficult to resolve into 
individual pulses if these are too close together. 



APPENDIX 5 545 



B = 


2A/ el 


B = 


1 

4<T 


A/ e 


1 
"to 



Finally, another system response often encountered is that of exponential decay (Fig. A5. Ic). 
In this case we may write 



/i(0 = -exp[ — J t>0 



Fromeq. (A5.1) 



t<0 



H(f) = - J_ expj -- j exp(-i27c/0 



df 



1 r- 

i 



?j - + i2jr/ 



df 



1 + \2nfx 



whence 
W{f)\ = 



(1+4jt/V) 1/2 
and 

A/ opI = 0.276/i 
A/ eI = 0.159/x 



(A5.4) 
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Physical constants and properties of 

some common semiconductors at room 

temperature (300 K) 



Rest mass of electron m = 9. 1 1 x 1 0" 3 ' kg = 0.000 549 amu 

Charge of electron e = 1.602x1 0~ 19 C 

Electron charge/mass ratio e/m = 1.759 x 10" C kg" 1 

Avogadro's constant N A = 6.022 x 10 23 mol" 1 

Planck's constant /, = 6.626 x 10" 34 J s 

h = h/2K =1.055xlO" 34 Js 

Boltzmann's constant k = 1.381 x 10~ 34 J K _1 

Speed of light (in vacuum) c = 2.998 x 10 s m s M 

Permittivity of a vacuum e Q = 8.854 x 10~ 12 F m" 1 

Permeability of a vacuum ^ = Anx 10" 7 = 1.258 x 10'* H m" 1 

Stefan -Boltzmann constant a = 5.670 x 10" s W m" 2 K 4 



g?^^-IZZl_ _ Si Ge GaAs 

Atomic (molecular) weight 28.09 72 60 144 6 

Energy gap, £ g (eV) L12 061 { ^ 

Intnnsic carrier concentration, n { (m 3 ) 1.5 xlO 16 2 4xl0 19 1 x 10 13 

Electron mobility, u e ( m 2 V" 1 S "') 0.135 39 85 

Hole mobility, ^ (m 2 v' s"') 0.048 0*19 045 

Relative permittivity, Er 1L8 { 6Q ' t 

Electron effective ma s S \ m * {xm) . 1 2 0.26 068 

Hole effective mass*, m * {xm) . 38 .23 56 

Recombin auonconsta nt, B (m 3 s"') 1.79 x 1Q^ ' 5.25 x lO" 20 721 x 10" 16 

l WehSpt^ ° f GaAs qU ° ted in the ]lterat - - f ™ '0-7 to 13.6. 

J n^S^a^";;^" 8 ' ^ defined ' namd * the <*"-'>' */*'<«» and cW™, f v effective 
values given here are representative of those quoted for the conductivity effective mass. 
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Laser safety 



Most practical lasers emit radiation that is potentially hazardous. The degree of hazard is 
related to the output characteristics of the laser, the way in which it is used, and the 
experience of the operator. The types of laser and the ways in which they are used are, as 
we have seen, many and varied. It would therefore be unreasonable to suggest that all laser 
applications require the same degree of control. Accordingly, systems of classification of 
lasers have been introduced, which are given below. 

Any classification must be operable and consistent with our knowledge concerning the 
injuries that laser radiation can cause. The mechanism by which laser radiation causes damage 
is similar for all biological systems and may involve thermal, thermoacoustic and photo- 
chemical processes. The degree to which any of these mechanisms is responsible for damage 
depends on the parameters of the laser source, such as wavelength, pulse duration, image 
size and power, and energy density. 

One of the principal characteristics of laser radiation is its beam collimation. This, together 
with a high energy content, can result in large amounts of energy being transmitted to bio- 
logical tissues. The primary event causing damage is the absorption of the radiation by the 
biological system. Absorption occurs at an atomic or molecular level and is thus wavelength 
specific. Thus it is primarily the laser wavelength which determines which tissue is liable 
to be damaged. Absorption of radiation leads to a rise in temperature which may then disrupt 
molecular bonds and hence impair the function of the molecule. The rise in temper 
related to the length of exposure and the irradiance of the beam, and this leads to the t' 
of a radiant exposure, or dose, measured in joules per square metre. Exposure to O-switchM 
or mode-locked lasers can cause a very rapid rise in temperature of the tissue, so that \i 
components may be converted to gas leading to a rupturing of cells. In general the i'T" 

relationships between damage mechanisms and exposure are very complex (ref A 7 n -^i 
we shall not pursue them here. The maximum permissible exposure (MPE) level - f wt 
the human eye will not suffer adverse effects from exposure to laser radiation unde^ 
conditions ; are given in ref. A7.2a, which gives guidelines and recommendations for the I IK 
and rest of Europe. ne UK - 

Probably the most vulnerable part of the body as far as laser radiation is mn a ■ 
eye. This is mainly because the eye lens will focus incident collimated laser r T '* 
small point with a radius of the order of the wavelength, and with a corresnnnH I u °" t0 * 
density. The hazards are wavelength dependent so that radiation in ,S2 h T & 
regions, which „ absorbed by the cornea, represents a danger to ftc^™::^^ 
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in the visible and near-infrared represents a retinal hazard. The increase in irradiance from 
the cornea to the retina is given approximately by the ratio of the pupil area to that of the 
image formed on the retina. Typically, the pupil may expand to a diameter of 5 mm with a 
corresponding retinal image of 10-20 urn diameter. Thus the irradiance increases by a factor 
of between 2 x 10 5 and 5 x 10\ A laser beam of 50 Wm~ 2 incident on the cornea then 
represents an irradiance of about 25 MWm" ; on the retina. About 5% of the radiation falling 
on the retina is absorbed in the pigments within the rods and cones, and this may be suffi- 
cient to cause a burn and a loss of vision. 

In general terms, skin can tolerate a great deal more exposure than can the eye, although 
blisters, ulceration and scarring of the skin may occur for high levels of irradiation. Again, 
the hazards are wavelength dependent, being more pronounced for ultraviolet radiation. Any 
organization which uses lasers would be advised to draw up a safety code of practice, which 
should be based on an accepted classification of lasers. The classification accepted in the 
UK and Europe is described below. There are slight differences in the classification used 
in the United States. These classifications and related information on laser safety are given 
in considerable detail in the various publications cited in ref. A7.2. 



( A7 -0 Laser classification based on BS 4803 

Class 1 

Output power is so low as to be inherently safe. 

Class 2 

Such lasers operate in the visible part of the spectrum (400 nm to 700 nm) and their output 
power is limited to 1 mW for continuous wave (CW) operation. Such lasers are not inher- 
ently safe, but some eye protection is afforded by the natural aversion response of the eye, 
including the blink reflex. Hazards can be controlled by relatively simple procedures. 

Class 3A 

These lasers operate in the visible part of the spectrum (400 nm to 700 nm) and their output 
is limited to 5 mW for CW operation. Some protection is still provided by the aversion 
responses. Direct intrabeam viewing with optical aids may be hazardous. 

Class 3B 

These lasers operate in any part of the electromagnetic spectrum between wavelengths of 
200 nm and 1 mm. Their output power is limited to 500 m W for CW operation. Direct beam 
viewing could be hazardous and must be avoided. Likewise specular reflections may be 
hazardous but diffuse reflections will not generally be so. Under no circumstances should 
the beam be viewed with optical aids. More detailed control measures are necessary. 



APPENDIX 7 549 

Class 4 

These lasers also operate within the wavelength range 200 nm to 1 mm and their output power 
exceeds 500 mW. Not only is viewing of the direct beam and specular reflections hazardous 
but, in some cases, viewing of diffuse reflections may also be hazardous to the eye In addi- 
tion, there is also a risk of skin burns from the direct beam and from first-order specular reflec- 
tions. The beam from such lasers is also capable of igniting flammable material .so that care 
must be taken to minimize the risk of fire. The use of Class 4 lasers requires extreme camion 
for safety both of the user and of other persons who may be present. If possible the system 
should be totally enclosed. 



(*H} Safety note ~~ 

It is stressed that the above classification is intended as a guide only and that laser users should 
consult the publications given in ref. A7.2. Particular attention is drawn to the American 
Standards for the safe use of lasers' Number Z136 issued by the National Standards Institute 
(ref. A7.2b). 

Each laser device should carry an appropriate label and notice of warning (ref A7 2a 
Part 2, p. 3). ' 

The safe use of lasers often involves the provision of safety interlocks and warning lights 
on access doors to rooms where lasers are being used, together with beam stops and enclo- 
sures. Materials which act as diffuse reflectors should be used wherever possible Protective 
eyewear should also be available for use with Class 3B and Class 4 lasers. Bearing in mind 
that the protective medium will not afford protection over the whole spectral range, care must 
be taken to ensure that the eye protection has the correct spectral response to match the laser 
being used. 

Finally, it should be remembered that most lasers are high voltage devices (several kilo- 
volts) often capable of delivering large currents and therefore care must also be taken in this 

1 CoIJCk.1. 
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aberrations (thin lens), 25 
absorption 
coefficient. 173 

effects on detector performance. 

318 
variation in semiconductors, 316 
loss in fibers, 395 -7 
of radiation, 173-5 
acceptors, 55 

acting a.s traps, 130 
acoustic phonon, 501 
acousto-optic 

beam deflection, 480, 484 
effect, 112-16 
modulation, 1 16 
activators, 129 
active displays. 129 
active mode locking, 252-4 
active phase tracking, 504-6 
active region, 205 
alexandrite laser, 202-3 
amplifier front end design, 454-6 
high impedance. 454 
low impedance, 454 
transimpedance, 454 - 6 
amplitude shift keying (ASK). 435 
analog modulation, 429-32 
anamorphic lens, 445 
anisotropic material, 527 
anisotype junction, 80 
antireflection coating 
fiber bundles, 521 
Schottky photodiodes, 335 
silicon photodiodes, 329 
ami-Stokes shift, 501 
argon ion laser, 226 - 8 
array 
CCD 2D, 347 
detector, 344-8 
laser, 219-20 
phased, 220 
scanning, 164-5,344 
VCSEL.221-2 
articulated arm, 280 
aspect ratio, 285 
atmospheric 
turbulence. 436 



window, 436 
attenuation in optical fibers. 388, 395 

measurement. 407-8 
avalanche 

breakdown. 79 
photodiode, 337-40 

amplification process, 337-8 
in communication systems, 438, 

462 
inOTDR,412 
noise, 340- 1 
reach-through, 339 
separate absorption and 

multiplication (SAM), 340 
structure. 338-9 
axial modes, 190-3 

backscatter (fibers), 4 1 1 

balanced receiver configuration, 469 

486 
band bending. NEA surfaces, 304 
bandgap see energy gap 
bandgap wavelength, 86, 1 17, 314 
bandwidth 

electrical, 541 

optical, 541 
beam 

coherence, 260-4 

collimation. 259 

divergence, 258-60 
table, 259 

modulation telemetry. 269-71 
beam splitter (fiber). 5 10, 5 1 8 
beat length, 418, 513 
beta-barium borate, 237 
birefringence 

fibers, 418, 512-13 

induced, 532-4 

natural, 527-32 
birefringent filter, 204 
bistable optical devices, 482-4 

logic gates. 482 

optical computer, 482 

R-SEED devices, 482 

S-SEED devices, 483 

single rail logic, 484 
bit, 433 



error rate. 434 -5. 452, 464 
btackb<x!y radiation. 28-30 
Planck's law, 30 
Wien's law. 30 
Bohr model. 31-2 
boiling point, 281 
bolometer, 298-31X). 348 
Bolizmann statistics. 175-7 
Bragg 

acousto-optic grating. 114-16 
fiber grating, 420-1. 467. 484, 
498-9 
as sensor. 498-9 
hydrogen loading. 420 - 1 
notch filter. 467 
phase grating, 421 
strain sensitivity, 498 
grating reflector, 480. 486 
Brewster angle. 8 
Brewster law. 8 
brightness 

displays. 157-8, 165 
laser radiation, 264-5 
LEDs, 157-8 
Bnllouin scattering. 468. 501 
Brillouin zone. 45 
Bumis emitter. 444 

cadmium selenide (CdSe) 

photodetectors. 319 
cadmium sulfide (CdS) photodetectors 

319 
cadmium telluride, 107, 354 
capacitance, junction, 75-8 

effect on LED response time, 152 
effect on photodiode response time 
333 
capacitor. MOS, 344 
carbon dioxide laser, 229-32 
gas dynamic laser, 232 
gas flow laser, 232 
materials processing. 279. 281-3 
rotational- vibrational energy levels 

229-30 
sealed tube laser, 23 1 
surgery, 418 
TEA laser, 232 
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carrier 

concentrations. 57-62 

confinement. 81 

lifetime. 152-3 
cathode ray tube (CRT). 134-7 

electron gun, 135 

faceplate, 136 

raster canning, 135-6 

screen, 135 

shadowmask, 136 

structure, 135 
cathodoluminescence, 133-7 

definition, 129 

energy transfer mechanism, 134 

use in CRT screen. 135 
cavity 

lifetime, 256 

optical, 179-80 
chalcogenide glasses. 4 1 8 
characteristic luminescence, 130 
charge 

carriers, 47 

coupled device <CCD), 344-8 
arrays, 351 

polysilicon gate, 347-8 
2D array. 347 

storage capacitance, 77 

transfer efficiency, 346 
chemical vapour deposition 

axiaI(VAD),414 

modified (MCVD), 413 

outside (OVD), 414 
chirped diffraction grating, 480 
cholesteric liquid crystals, 159 
chromatic aberration, 25 
cladding 

elliptical, 420 

mode, 377, 406 

pumping, 202 

waveguides. 359. 365, 474 
cleaved coupled cavity, 249 
co-activators, 130 
coded mark inversion, 435 
coherence, 260-4 

length, 262. 510 

in frequency doubling, 121 

of laser light, 260-4 

spatial, 262 

temporal, 262 

time. 263 
coherent fiber bundles, 5 19-21 
coherent optical fiber communication 
systems, 464, 468-70 

balanced receiver configuration, 469 

polarization diversity, 469 

polarization scrambling, 469 
common emitter gain, phototranssstor, 

342 
compact disc. 487 
compound lens, 26 
conduction band, 43, 47-8 
cone (eye), 548 



contact potential, 69 
conservation of wavevector in 

interband transitions, 142-3 
copper indium diselenide, 354 
core 

elliptical fibers, 419. 513 

waveguides, 359, 365, 474 
comea. 547 

sculpting, 286 
comer cube reflector. 269 
coupled mode theory, 405, 460. 478 
couplers 

multifiber. 403-5 

single mode. 405-6 
cross-talk (fibers), 519-20 
crystal 

class, 529 

field, 197,200 
Curie temperature. 107-8, 301 
cut-off frequency in photodetectors. 

294.301 
cut-off wavelength (optical fibers), 
409-10 

D\ 294-5, 344 
dark-line defects, 217 
de Broglie equation, 37 
decision level, 434 
decision time, 434, 464 
degeneracy, 172, 240 
degradation (lasers). 217- 18 
density of states function. 57 

quantum well, 85 
depletion layer 

capacitance, 75-7 

width. 328 
depletion region, 69, 81, 325, 333 
depressed cladding fibers, 392 
difference frequency generation. 125 
diffraction, 480 

grating, 467 

grating (chirped). 480 

limited. 26 
diffusion 

capacitance, 77, 152 

coefficient, 65 

effect on LED response, 152-3 

Ienglh, 6.5, 205 

of carriers, 64-6 

in relation to photodiode 
response. 3.34 

potential, 69 
digital modulation, 432-6 

fundamental limitations on signal 
size, 45 I -4 

noise. 433. 452 
dipoles (electric), 301,527 
direct bandgap, 46. 142-3, 205 
direct detection, 429, 452, 470 
directionality of laser radiation, 

258-60 
director (liquid crystal), 159. 162 



dispersion 

compensation. 462 

flattened fibers. 392, 462 

mtermodal. 378-82, 383-4, 457 

material, 389-91 

measurement, 408-9 

profile, 389. 39 1 

shifted. 462 

waveguide. 389, 391 
displays 

aclive, 129 

brightness. 157-8 

passive. 129. 158 

seven bar segment, 163 

7x5 matrix, 16.3 
distance measurement. 267-74 

beam modulation telemetry, 269-70 

interferometric methods, 267-8 

pulse echo technique, 271 
distributed feedback laser, 221, 469 
document scanner, 518 
donors. 53-4 

acting as traps. 130 
doped insulator laser, 196-204 

diode pumped, 200-2 
doped semiconductor photodetector, 

321 
doping, 53-6 
Doppler 

broadening, 184 

effect. 1 16.246 
double refraction .see birefringence 
drift (charge carriers), 50, 69 
dye (saturable), 253-4 
dye lasers. 233-6 

effective density of states. 60-1 
effective mass. 50. 56 
eigenfunctions, 41 
eigenvalues. 41, 42 
Einstein. 2 

coefficient. 171, 188 

relations. 172 
lasers. 171-2 
semiconductors. 65 
electric 

dipoles. 301,527 

displacement, 527 

permittivity tensor. 528 

polarization. 119-20,527-8 

susceptibility, 527 
tensor. 528 
electrical 

bandwidth. 541 

conductivity. 48-5 I 
electroluminescence, 129 

ax. powder device, 138-41 

a,c. thin film device, 138, 141 

classical. 129 

d.c. powder device, 139-40 

dc thin film device. 138 

definition. 129 
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injection. 138, 141-2 

emission mechanisms. 1,19-40 
electromagnetic spectrum. 2 
electro-optic 
coefficients. 96-7 
effect. 96-107 
modulator, 477-8 
longitudinal. 103 
transverse, 103-4, 47h-7 
electron 
accumulation. 80 
affinity. 62. NO. 82. 304 
beam range, 1 34 
beam pumped lasers. 233 
gun, ] 3.5 
electron-hole pairs, 47 
emission <of radiation) 
spontaneous. 170-1 
stimulated. 170-1 
endoscope, 286.519 
energy 

bands. 42-8 
gap, 43, N4 
energy-level systems (in lasers) 
four level, 177-8. 187. 198. 203 
three level, 178.458 
two level, 177, 200 
evanescent field, 364 
excess carriers. 63-4 
excess loss (fiber couplers), 403 
excess noise factor, 340 
excimer laser, 233 

materials processing, 279. 282 
ophthalmology. 286 
excitons, 56-7, 1 17 
bound. 145 

in cathodoluminescence, 134 
in LEDs, 145 
exponential decay, 294, 545 
external quantum efficiency. 151 
extra mural absorption (e.m.a.). 521 
extraordinary ray. 93-4, 123 
extraordinary refractive index, 94 
extrinsic semiconductors, 53-6 
eye diagram. 463-4 

Fabry-Perot 
cavity, 125 

interferometer, 18- 19 
resonator, 179, 192. 244 
Faraday- 
effect. 110-12 
rotation. 513 
Fermi-Dirae function. 58 
Fermi energy level. 58. 62. 69, 80, 81, 

83 
fiber 
acceptance angle, 376, 441 
bundles. 516-21 
arrays, 5 1 9 
beam splitters, 518 
coherent, 286. 519-21 



core packing fraction, 5 If) 
cross-talk. 519 
endoscope. 286. 519 
fiberscope. 519 
in medicine. 286 
matrix display. 518 
inner. 5 IN 
rigid. 519-21 
Y-guide. 518 
cables. 421-3 
couplers. 412- 16 
dispersion measurement, 408 9 
jointing. 397 402 
losses, 393-7 
absorption. 397 
bending. 393-5 
in plastic liber. 416 
jointing. 397-402 
microbendiiig. 394 
mid-infrared fibers, 417 
scattering. 359 
manufacture. 413 to 
fiberscope. 5 19 
filaments in lasers, 21 1 
llicker noise. 319 
fluorescence. 1 29 
fluorescent dyes. 234 
fluorescent lamp. 133 
fluoroulkvl inettiacrylale. 415 
FItioroptic sensor. 496 
flux budget calculation. 462 
focal length. 25 
focal point. 24 

focusing of laser radiation. 265-7 
foot-Lambert, 158 
Fotome sensor, 493 
Fourier transform, 541 
frame transfer. 346 
Fraunhofer diffraction. 21-2 
free electron laser. 238-9 
free space communications, 436-8 
frequency 

doubling .vrr second-harmonic 

generation 
shift keying (FSK), 436. 468 
stabilisation in lasers. 245-9 
Fresnel 
diffraction. 21 
loss at fiber joints. 398 
reflection. 41 1, 521 
Fresnel -Kirchhoff formula. 21 
Fresnel 's equations. 361 
fused biconic taper coupler. 405 

use as wavelength mixer. 460- 1 
fused plate. 520- 1 
fusion Oaser induced), 286-8 
fusion splicing. 397 

Gabor, 271 
gam 

bandwidth product (GBP). 505 

coefficient, 175 



nu\r. 192, 24 S 
saturation. 182. 468 
gallium aluminium arsenide (( iuAIAo 
IictetojiiiK'tion. Nil 
IO waveguides. 475 
laseis. 208. 21 I I5.4S2 
M'IK. 148 
quantum wells. S 5 
VC.Shl.s. 222 
gallium aisemdc K iaAs) 
fiber scusois. 295 
lietcio|iiiiction. 8(1 
induced huetititgencc. S*2 4 
uiteiisilted photodiode. U4 
IO devices. 486 
IO waveguides, 4 7S 
laseis. 20S 7.211 
I.lilK 147 
metal semiconductor metal 

photodiode. "7. 484 
pfiotocathodes. M ( 
I'ockelx modulators. 107 
quantum wells. 8! 
Scruittk) photodiode, 06 
solar cell. 154 
V( "SI-I..X. 222 
gallium arsenide phosphide (< laAsd 
intensified photodiode. '44 
lasers. 215. 216 
I.HDs. 148 

Schottlvy photodiode. H(, 
gallium tmlmnt aisenide it ialn.As! 
API). 340 
IO devices. 486 
photoconductoi. '20 
photodiodes. 331 2 
gallium indium arsenide phosphide 
(GalnAsP) 
API). 457 
IO devices. 486 
lasers. 216 17 
photodiodes. 331 
gallium nitride. 222 
gallium phosphide (GaP). 147. 336 
gas lasers. 223 33 
argon ion. 226 7 
GO.. 229 32 
excimer. 233 
He -Ne. 223 6 
metal vapour. 228 
nitrogen. 233 
gate (MOS device). 344 
Gaussian 

frequency distribution. 185 
mode, 193-5. 3N7. 446 
pulse. 408.544 
geodesic tens. 484 

generation recombination noise. 318 
germanium 
APD. 340.456 
photodiode. 3 i 1 
glass fiber, 516 
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amplifier, 202 

laser. 201-2 
glaucoma, 286 
Golay cell. 300 
graded index fibers. 382-5 

intemiodal dispersion, 383-5 

light coupling from LED. 443 

modes. 384-5 

ray paths, 384-5 

refractive index profile, 382 
GRIN lens, 26-8,444 
group 

index, 38y 

transit time, 409 

velocity, 4-5,389 
guard ring (APD), 338 
gyro (fiber). 508-11 

half-wave voltage, 100-3 

heat-affected zone, 283 

He-Ne laser. 223-6 

Heisenberg's uncertainty principle, 38, 

144 
helical ray, 385 
heterodyne detection, 429-30, 468 

balanced receiver, 486 
heterojunction, 79-81. 331 

lasers. 211-18, 348 

stripegeometry.211-14, 217 
high bifrefringence fiber, 419-20 
hole burning, 245-6 
holes, 47 
hologram, 271 

holographic interferometry, 274-6 
holographic optical memories, 276-8 
holography, 271-4 
homeotropic ordering, 159 
homodyne detection, 43 1 . 468 
homogeneous broadening, 185.245 
homogeneous ordering, 159 
homojunctions, 67-70, 331 

lasers. 204-7 
Huygens' construction, 94 
Huygens' principle. 19 
hydrogen loading, 421 

ideality factor, 79 
idler wave, 237 
image 
conduit, 520 
contrast, 52 1 
intensifies, 312- 14, 521 

first-generation types, 312-13 
second-generation types, 313 
third-generation types. 313-14 
impurity centre recombination, 144-5 
index 
ellipsoid, 93, 529-30 
matching, 122-3, 125 
indirect bandgap, 142-3 
indium anttmonide (InSb) 
photoconductor, 321 



indium gallium arsenide phosphide 
(InGaAsP).we 
gallium indium arsenide 
phosphide 
indium gallium nitride (InGaN), 222 
induced birefringence. 532-4 
inhomogeneous broadening, 185-6, 

245-6 
injection luminescence, 141-2 
insertion loss, 402, 404 
integrated optics, 472-87 
beam deflectors, 480 
bistable devices, 482-4 
couplers, 478-80 
detectors. 481,484-6 
emitters, 481-2 
filters. 480 
modulators, 478-80 
phase shifter. 476-8 
quantum well modulator, 118 
spectrum analyzer, 484 
switch, 477-80 
waveguides, 472-5 

field distributions, 474-5 
losses. 475 
waveguide splitter, 475-6 
intensified photodiode, 343-4 
interband transition. 142-4 
interference. 14-19 
fringes. 13.274-5 
multiple beam, 19 
thin film. 16-17 
two beam, 14-17 
interferometer 
Mach-Zehnder, 503-4 
Michelson, 268-9 
interferometry (holographic). 274-6 
interline transfer, 346 
intermodal 

dispersion. 377-82, 38.3-5, 457 
frequency separation, 190-1, 244 
internal quantum efficiency, 151, 210 
intrinsic semiconductors. 51-3 
ion implantation. 83 
ion lasers, 226-8 
isoelectronic trap, 148 
isotropic material, 527 
isotype junction, 80 

Johnson noise, 31 1, 448-50, 454, 456 
junction 

anisotype, 80 

capacitance, 77 

geometry, 75 

heterojunction, 79-8 1, 33 1 

homojunction, 67-70, 33 1 

isotype, 80 

metal -semiconductor. 81-3, 334 

metal -semiconductor-metal, 336, 
484 

ohmic, 80, 83 

p-i-n, 329-30 



p-n. 67-79 
rectifying, 80 

kerf. 283 
Ken- 
constant. 97 
table, 107 
effect, 96 

magnetic, 112 
optical. 108,470 
modulators, 107-8 
keyhole (laser welding). 282 
kinks (laser output). 21 1. 447 
krypton fluoride excimer laser. 23.3 
Kynar. 422 

Lamb dip, 247 
Lambertian emission, 157 
coupling to fibers, 442 
display brightness, 157 
source, 410. 516 
laser 
arrays. 219-20 
beam delivery, 279 - 80. 4 1 8 
cutting. 278, 283 

heat-affected zone, 283 

kerf, 283 
drilling, 278, 284 

aspect ratio, 285 

excimer laser, 285 

ruby laser, 284 

taper. 285 
induced fusion, 286-8 
light (properties of), 258-67 

brightness, 264-5 

coherence, 260-4 

directionality. 258-60 

focusing properties, 265-7 

tunability, 236. 267 
losses, 181 
marking, 278, 284 
materials processing, 279 
medical applications. 285-6 
mode 

frequency separation. 190-1 

volume, 179-80, 193-4.207 
modes 

axial, 190-3 

transverse, 193-5 
safety, 547-50 
scribing. 284 
vertical cavity surface emitting 

(VCSEL). 220-2 
welding. 278, 281-2 

shielding gas, 281 

keyhole formation, 282 
lasers 
alexandrite. 202-3 
cadmium ion, 228 
diode pumped, 200- 1 
doped insulator, 196-204 
free electron, 238-9 



i<-q- 



INDE 



si: 



gas, 223-33 
glass fiber, 201-2 
krypton fluoride, 122 
liquid dye. 233-6 
metal vapour ton, 228-9 
Nd- glass, 199 
Nd-YAG, 196-9,438 
parametric, 236-8 
quantum well, 218- 19 
ruby, 199-200 

semiconductors, 204-22, 248-9 
see also semiconductor lasers 
short wavelength, 222 
tunable. 202, 238 
vibronic, 202-4 
xenon fluoride, 233 
latent heats. 281 
lattice matching, 222, 331 
lead sulfide (PbS) photoconductor, 

319-20 
lead zirconate, 301 
leaky mode, 374, 406,410 
lens 

anamorphic, 445 
compound, 26 
geodesic, 484 
GRIN, 26-8,444 
sphere, 26, 444,445 
thin spherical, 24-6 
lidar, 271 
lifetime, 170 

light-emitting diode (LED), 141-55 
communication systems, 428, 438, 

441.461,463 
construction, 149-52 
drive circuits, 153-5 
edge emitting, 81,444 
emission linewidths, 146 
impurity centre recombination, 

144-5 
injection luminescence, 141-6 
materials for, 147-9 
quantum well, 86 

recombination processes in, 142-6 
response times of, 152-3 
surface emitting, 81 
superluminescent, 222-3,510-11 
line coding, 435 
linearly polarized light. 7- 1 1 
linearly polarized mode, 374, 385 
lineshape function, 183, 537 
tinewidth, 260 
liquid crystal 
displays, 158-63 

cell construction. 160-3 
effect of applied voltage, 161 
matrix, 487 
supertwist type, 163 
twisted nemetic type, 160-3 
light valve, 348-51 

accumulation phase, 351 
depletion phase, 350 



hybrid field effect mode. 349 
large screen projection, 35 1 

phase epitaxy. 475 

pitch, 159 

state. 159 
lithium niobate (LiNbOJ 

holography. 278 

10 devices. 475, 477-N, 4H1 

parametric laser, 237 

parametric oscillation, 124 

Pockets cell. 107,257 
lithium tantalate. 107 
local area network (LAN). 465-6 

components, 466 

topologies. 465 
logic gate, 482 
Lorentzian lineshape, 185 
loss coefficient (lasers), 1K1 -2, 209 
luminescence. 129-31 

characteristic, 130, 134. 137 

injection, 141 

non-characteristic. 130, 134, 137 

lifetime. 130 

Mach-Zehndcr interferometer. 503-5 
machine vision, 281 
magnetic radiation, 239 
magneto-optic devices, 1 10- 12 
magnetostrictive materials. 508 
magnification 

lens, 25 

mirror, 24 
Maiman, 195 

majority charge carrier, 53 
material dispersion, 389-91. 462 

in SiO,, 390 
materials processing (laser), 281-5 
matrix displays, 163-6 

brightness. 165 

fiber bundles, 5 1 8 

ideal element/characteristic. 165 

scanning, 16fl 

suitability of electroluminescent 
devices, 141 

use of liquid crystals in, 165-6 

wiring methods, 164 
Maxwell 

electromagnetic theory. 529 

equations, 360 
medical applications (lasers), 285-6 

cancer. 286 

corneal sculpting, 286 

dentistry, 286 

glaucoma. 286 

hypermetropia, 286 

myopia, 286 

ophthalmology, 286 

optical fibers. 286 

photodynamic therapy, 286 

photorefractive keratectomy. 286 

plaque removal, 286 

surgery. 285-6.418 
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mercury cadmium lellmtdc (1 

321.123 ^^""V 

meridional ray. 374, 377. !N4, S 
MI-SFliT amplifier. 486 
metal oxtde seitiicoii<hicn>i (MOM 

capacitor, 344 
metal ■ sciiucomttictoi junction. N I ! 
metal ■■semiconductor metal (MSM) 

photodiode, Wi 7 
metal vapour ion laser. 228 9 
mctastable state. 129. 177 
metrology, 267 71 
Michelsott interferometer. 267 8 
mtcrobemts 

etieet on fiber losses. 394 
effect on mode dispersion. 1M1 
in cables. 422 
sensors. 497 
wierobolometer. WH 
microchannel plate, 3 1 3 
mterolilter (CRT). 1 37 
mid-infrared fibers. 417 18 
chalcogcmdc glasses. 4 IS 
ZHLAN.417 
Mte scattering. 436 
minority charge carrier, 53 
mirror (spherical). 23 -4 
multilayer, 221 

bandpass. 467 
phase conjugate. 280 
mixer (fiber). 518 
mixing rod. 403. 466. 5 1 2 
mobility. 49 
mode 
coupling, 381.393 
cut-off. 368,378 

wavelength. 387 
dispersion. 378-80. 383-5 
field diameter. 387 
locking. 250- 4 
scrambler, 406 
stripper. 406 

volume, 180.193-5.207.208 
modes 

fibers. 374-5.385 
cladding. 377. 406 
electric field distribution. 375 
leaky. 374. 406.410 
linearly polarized. 374. 385 
sensors, 497 

single mode. 385 -7, 389. 446. 
457 
laser, 190-5 

single mode operation. 244-5 
planar waveguides. 368-73 
modified chemical vapour deposition 

(MCVD1.413 
modulation. 428-36 
analog. 425-32 

digital, 432-6 

frequency. 43 1.436. 438 

lasers. 447 
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LEDs, 154. 445 

pulse, 429 
molecular lasers, 229-33 

CO,, 229-32.438 

excimer, 233 

gas dynamic, 232 

nitrogen, 233 
monochromatic aberration, 25 
monofibers, 5 16 
multilayer mirror, 221 
multiple quantum well, 482 

detector, 323-4 
multiplication noise, 31 1 
mutual coherence function, 261 
myopia, 286 

negative 

bandgap, 321 
crystals, 122,532 
electron affinity, .303-5 
nemetic liquid crystals, 159 
neodymium lasers, 
diode pumped, 200- 1 
glass, 199 

materials processing. 282 
YAG, 196-9.438 

materials processing, 279, 282 
^-switched. 237 
nit, 157 

nitrogen laser, 233 
noise 

APD receiver, 450- 1 
avalanche, 340- t 
equivalent power (NEP), 294 
excess noise factor (APD) 340- 1 

450 
flicker. 319 

generation-recombination, 318-19 
in digital systems. 45 1-4 
Johnson. 311,448,456 
multiplication. 31 1 
photoconductors, 318-19 
photomultiplier, 310-12 
p-i-n receiver, 448-50 
shot, 310 
white, 294 

non-characteristic luminescence 130 
137 ' " ' 

non-linear optics, 119-24 

frequency doubling see second- 
harmonic generation 

paramagnetic amplification, 124-6 
non-return-to-zero (NRZ), 432 
normalization condition, 41 
normalized film thickness (V). 369 
normalized impulse response, 541 
numeric displays. 163-6 
numerical aperture (NA), 376, 516 

object wave (holography), 273 
ohmic contacts, 83. 328, 354 
ohmic junction, 80, 83 



Ohm's law, 49 
optic axis. 93, 95. 238. 529 
optical 
activity, 96-7 
bandwidth, 541 
cavity. 179-80.237 
computer. 482 
confinement, 214 
feedback. 179-81 
fiber amplifiers, 457-61. 464 
erbium in silica, 459, 469 
gain profile, 459 
praesodymium in ZBLAN, 461 
pumping arrangements, 458-9 
use in optical fiber 
communication systems, 464 
fiber communication systems 
advantages, 439 
choice of fibers, 457 
coherent systems, 464, 468-70 
components. 461-5 
detectors. 448, 456-7 
direct detection. 470 
dispersion compensation, 462 
emitter design, 438, 440-7 
eye diagram, 463-4 
flux budget, 462 

limitations on signal size. 45 1 -4 
solitons, 47 1 

transatlantic iinks, 464-5 
wavelength division multiplexing. 

466-8 
wavelength used, 440 
fiber sensors. 492-5 15 
Bragg grating. 498-9 
Brilloum scattering, 501 
displacement, 492-3 
distributed, 500-3 
extrinsic, 492-7 
Fluoroptic, 496 
Fotonic. 493-5 
gyro, 5 1 1 

intrinsic. 492. 497-500 
magnetic field, 508 
microbending, 497 
pH. 500 

polarimetric, 51 1 -13 
pressure, 495. 507 
proximity, 519 
Raman scattering, 50! 
Rayleigh scattering, 500 
refractive index, 500 
rotation, 508-1 1 
single mode, 503-8 
smart structures. 502 
strain. 498-9. 513 
temperature, 495, 496, 498-9, 
500,507-8.513 
fibers, 359-60. 373-7 
all plastic, 415-16 
beam delivery, 280 
beat length, 418 



Bragg graiing. 420-1 

cables, 421-3 

depressed cladding, 392 

dispersion flattened, 392. 462 

dispersion shifted. 462 

elliptical cladding. 420 

elliptical core. 419 

fusion splicing, 397 

graded index. 382-5 

high birefringence <Ht-Bi). 419 

in communication systems, 

438-72 
light coupling into, 44 1 -7 
mid-infrared, 280.417-18 
panda, 420 

plastic-coated silica. 4 14- 15 
polarization maintaining. 

418-20 
refractive index profile 

measurement, 410 
side pit, 420 
single mode, 385-8 
step index, 37.3 
frequency division multiplexing see 
wavelength division 
multiplexing 
indicatnx. 93,529-30 
parametric oscillator. 236 
phonons, 501 
resonator, 18 
scanning, 108- 10 

time domain reflectometry (OTDR), 
411-12,500 
ordinary ray, 93. 122 
ordinary refractive index, 93. 122 
output ratio. 403 

outside vapour deposition (OVD), 4 14 
overlap integral. 446 

packing fraction (fibers). 5 1 6. 520 

panda fiber, 420 

parametric 

amplification. 124 

lasers, 236-8 

oscillation. 236-8, 124-6 
paraxial ray, 23 
passive displays, 129, 158 
passive mode locking, 253 
passive ^-switching. 258 
Pauli principle, 42, 57 
penetration depth (electrons), 134 
phase 

conjugate mirror. 280 

grating. 421 

mask .421 

shift keying (PSK), 436. 468, 469 

velocity, 4 
phased arrays (lasers). 220 
phonon 

acoustic, 501 

annihilation. 143 

creation, 143 
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optical, 501 
phosphorescence, 129 
phosphors, 129 
in colour TV, 136 
in CRT screen. 1 35 
in electroluminescence, 1 38 — y 
photocathode, 303-5. 306. 312, .389 
dark current, 310 
materials, 303 -5 
photoconductive 
detectors, 314-18 
bias circuit. 3 [4 
idealized response. 3 1 8 
noise. 318-19 
gain. 317 
photodetectors, 293 
see also photoconductive detectors, 
photodiodes, thermal detectors 
photodiode response times. 332-4 
capacitance effects, 333 
carrier diffusion, 334 
carrier drift, 332-3 
photodiodes. 325-41 

avalanche see avalanche photodiode 
germanium, 456 
intensified. 343-4 
metal -semiconductor- metal, 
336-7,484 
interdigitated structure. 336 
photoamperic mode, 325. 328 
photoconductive mode. 325, 327 
photovoltaic mode, 325, 326-7 
p-i-n. 329-30 
quantum well. 86 
response time, 322-4 
responsivity, 331 
Schottky, 82, 334-6 
silicon, 328-9, 438,496 
wavelength response, 330 
photodynamic therapy, 286 
photoelectric effect, 303 
photographic emulsion, 278 
photoluminescence, 131-3 
definition, 129 
KCl;TI(Fig.), 133 
Stokes shift, 132-3 
temperature effects, 131 
use in fluorescent lamps, 133 
photomultiplier. 307-1 1, 448 
dynode bias circuit, 307-8 
dynode structure, 367 
gain, 307 
noise in. 310-11 
photon counting, 352 
response time, 309 
secondary electron emission, 307 
photon, 2 
counting, 352 
devices, 293 
wavevector, 142 
photorefractive keratectomy, 286 
photorefractive layer, 278 



phototransistor. 341-2 
p-i-n photodiode, 329 -30 
noise analysis. 448 - 50 
use in communications. 438. 448 
pitch 

GRIN lens. 28 
liquid crystal. 159 
planar dielectric waveguide. 364-73 
asymmetric. 472 

condition for beam propagation. 367 
modes. 368 

strongly asymmetric, 473 ■ 4 
field distribution. 474 
Planck's blackbody radiation law. 30 
plaque, 286 

plasma displays, 155-7 
plastic fibers, 415-16 
attenuation. 4 16 
materials. 415 
Rayieigh scattering, 417 
use in communications, 38X 
plumbicon. 322 
p-n junction, 67-70 
forward bias, 70-3. 141 
reverse bias, 73-5. 325 
pneumatic detector. 300 
Pockels 

effect. 96-7. 532-4 
modulators (or cell), 97- 103. 
257-8 
10 version, 476-7 
longitudinal, 103 
transver.se, 104 
Pois.son statistics, 452 
Poisson's ratio. 498, 507 
polarimetnc fiber sensors, 511-13 
polarization, 7 
circular, 91 
elliptical, 90-2 
maintaining fiber, 418-20 
plane. 7-1 I 
polarizing beam splitter. 349 
polymethyl methacrylate (PMMA). 415 
population inversion, 175-7 
in semiconductor lasers, 208-10 
threshold, 186-9 
positive crystals. 95. 532 
positive liquid crystal material. 160 
potential well, 39-42 
Poynting vector. 34 
principal 
axis, 94, 528 
permittivities, 528 
refractive indices, 528, 529 
section. 93 
prism, 23. 467 
profile dispersion, 389, 391 
profile parameter. 382. 497 
proximity sensor, 519 
pul.se modulation. 429 
pumping. 176-8, 186-90 
pupil (eye), 548 



pyrootectnc dctcctots, too 2 
frequency response, 501 

(.'■switching. t54 S 
electro optic. 257 8 
lasei safety. 5-17 
passive. 2SK 
rotating mirioi. 2VS o 
quadiaturc. 504 
quality factor. 1<>_' 
quantum 

confined Staik filed. Kf>, I 17 
dot. 219 
clTicicncv. 141, 5 V> 

external, 210 

internal. 210 

I.lil). 142, 14<> (tabic). Md 

photocathode materials [Fig I. t05 

photi vi inductors. \\1 

photoemissioii. 306 
mccliaiuc.il tunnelling. 78, I W 
number, 31.42. 229 !0 
well. 83 6 

detector. 332 4 

modulator. 1 IX 19 
well laser. 218 19. 4K6 
wire, 219 
yield. 303 
quarter wave plate. 9.S 

radiation density. 30. 172 
Raman scattering. 46X. 501 
Raman Nath aeousto-optR- gating. 

114 15 
taster scan. 136 
rate equations (laseis). 1X7 
Rayieigh 
criterion. 22 

scattering. 395 6. 41 1. 416. 417. 
500 
erbium-doped fibers. 5(H) 
long wav elength filiei s. 4 1 7 
plastic fibers. 4 Id 
silica fibers. 396. 500 
readout register. 346 
real image (holography). 273 
reciprocity. 509- 10 
recombination. 52 
lifetime. 64 
mechanisms, 52-3 

effect on LED response. 152 3. 

535-6 
in semiconductors, 142-4 
vtaexenons. 145 
via impurities, 144-5 
non-radiative. ^1 
radiative. 52. 142 4 
reference wave (holography). 273. 278 
reflectance. 8 
surface, 281 
refractive index. 3. 528 
extraordinary. 9.1, 529-30 
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ordinary. 93,529 

principal. 528 
relative permittivity. 3, 54, 80, 100 
repeater, 438-9, 457 
resolving power, 22 
resonant transfer, 224 
resonator see optical cavity 
response time, 294 
responsivity, 293 
retina, 286, 548 
return-to-zero (RZ) .signal, 432 
reverse 

bias breakdown, 78 

leakage current, 331 

saturation current. 141 
Richardson- Dushman equation. 3 10 
r.m.s. pulse width, 408-9, 541-2 
rods (eye), 548 
R-SEED device. 482 
ruby laser, 199-2 

Sagnac effect, 508 
sampling theorem, 433 
saturable absorber, 253-4, 258 
saturation current, 73 
scanning, 344 

Schotiky metal contact. 486 
Schottky photodiode. 82. 334-6. 344 
Schrodinger equation, 39 
second -harmonic generation, 1 19-24, 124 
semiconductor lasers. 204-23 
active regions, 205 
arrays, 219-20 
buried heterostructure, 213 

dark-line defects, 217 

degradation, 216-18 

distributed feedback, 248, 482 

GaAs, 205-7 

heterojunction, 211-18 

homojunction, 204-7 

index guiding, 213 

in optical communications, 438 
446-7,463 

kinks, 2 1 1 , 447 

power output, 210 

quantum well, 86, 218- 19, 486 

quaternary compounds, 216 

short wavelength. 222 

strained lattice, 222 

stripe geometry. 211,217 

ternary compounds, 21 1 

threshold current density. 207-9 

vertical cavity, 220-2 

wavelength chirp, 464 
sensitization centres. 3ig 
sensors (optical fibers) see optical fiber 
sensors 

seven-bar segment display, 163 
shadowmask, 136 
shielding gas (laser welding). 281 
short wavelength semiconductor lasers 
221 



shot noise, 
avalanche photodiode. 340, 451 
fiber interferometers, 508 
photomultiplier. 31(1 
p-i-n detector. 448-50 
side pit fiber. 420 
signal wave, 237 
silica phase mask, 42 1 
silicon-based waveguides, 486-7 
silicon nitride, 222 
silicon photodiode, 328-9 
electric field distribution, 329 
p-i-n, 330 
p-n, 328 
p-t-n, 330 
responsivity, 331 
structure, 328 

use in communications, 438. 461 
silicon solar cells, 354-5 
single mode fiber, 385-8 
couplers. 405-6 
in communications. 457, 462 
in sensors. 503-8 
lasers, 244-5 
mode field diameter, 387 
single rail logic, 484 
skew ray. 374. 385, 516 
skin (laser safety). 548 
slab waveguides, 472 
small signal gam coefficient, 175 
smart structures, 502 
smectic (liquid crystal), 159 
solar cell. 353-5 
fill factor, 354 
materials, 354-5 
ohmic contacts, 354 
silicon, 354-5 
Soleit-Babinet compensator, 5 12 
soiitons, 470-1 
specific heat. 281 
speckle pattern, 264 
spectral broadening, 183-6 
collision. 185 
Doppler, 184 
homogeneous, 185 
inhomogeneous, 185,245 
natural damping, 185 
spectral response 
eye, 32, 157 

photodetectors, 293, 318 
spectrum analyzer (IO), 484 
sphere lens, 26, 444, 445 
spherical aberration, 25 
spherical mirror, 23-4 
spiking oscillation, 199 
spontaneous emission, 170-1,464 
spontaneous transition lifetime, 170 
sputtering, 475 
S-SEED device, 483 
standing waves, 14, 190 
step index fibers, 373-82 

intermodal dispersion, 37g_S2. 457 



leaky mode, 374 

meridional ray, 374. 377 

modes, 374-5 

numerical aperture, 376 

refractive index profile, 373 

skew rays, 374, 377 
stimulated 

Brillouin scattering, 468 

emission. 170-1, 502 

Raman scattering, 468 
Stokes shift, 131. 133,501 
strained layer, 222 
stripe geometry. 211-13 
stripe waveguides, 475 
sum frequency generation, 125 
superluminescence, 223 
superluminescent diode, 222-3, 510 
superposition principle, 1 1 
superradiance, 223 
surface reflectance, 281 
surface states, 83 
susceptibility (electric), 1 19. 527 

taper. 285 
telephone, 

local networks, 465 
trunk links, 439 
television. 135-6 

thermal detectors, 293, 296-302. 448 
bolometer, 298, 300 
microbolometer, 348 
pyroelectric, 300-2 
structure, 297-8 
theoretical model, 296 
thermistor, 299 
thermoelectric, 298 
time constant, 297 
thermionic emission, 135,310 
thermistor, 299 
thermoelectric detector, 298 
thermoplastic photoconducting 

layer, 278 
thin spherical lens. 24-6 
threshold conditions, 181 -2 
current density. 207-10 
pumping power, 186-9 
temperature dependence, 214-15 
time constant (detectors), 294 
total 
internal reflection. 10. 360-4 
decay in second medium. 364 
LEDs. 151 
phase shift, 362 
transatlantic telephone links, 464-5 
transfer function, 408 
transitions 

atomic dipole, 130 
forbidden, 130 
transport register, 346 
transverse electric mode 
fibers, 374 
waveguides, 368 
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transverse electromagnetic mode 

(lasers). 19.1-5 
transverse magnetic mode 
fibers, .174 
waveguides, 368 
transversely excited atmospheric 
lasers. 232 
carbon dioxide, 232 
nitrogen. 233 
traps, 52 
effect on luminescence. 130- I 
effect on photoconductor 

response, 319 
isoelectronic. 148 
travelling wave modulator, I Oft 
iriglycene sulfate (TGS). 301 
tuning (laser radiation), 235. 237-8 

267 
tunnelling (quantum mechanical), in 

electroluminescence, 139 
twisted nemetic cell. 160-3 
two-beam interference. 14-17 
two-level binary modulation, 432 
Tyndall, 359 



V -groove, 41)0, 4X7 
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